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5* ESTS AND ENCODED HUMAN PROTEINS 

Background of the Invention 
The estimated 50,000-100,000 genes scattered along the human chromosomes offer tremendous 
5 promise for the understanding, diagnosis, and treatment of human diseases. In addition, probes capable 
of specifically hybridizing to loci distributed throughout the human genome find applications in the 
construction of high resolution chromosome maps and in the identification of individuals. 

In the past, the characterization of even a single human gene was a painstaking process, 
requiring years of effort. Recent developments in the areas of cloning vectors, DNA sequencing, and 

10 computer technology have merged to greatly accelerate the rate at which human genes can be isolated, 
sequenced, mapped, and characterized. 

Currently, two different approaches are being pursued for identifying and characterizing the 
genes distributed along the human genome. In one approach, large fragments of genomic DNA are 
isolated, cloned, and sequenced. Potential open reading frames in these genomic sequences are 

15 identified using bioinformatics software. However, this approach entails sequencing large stretches 
of human DNA which do not encode proteins in order to find the protein encoding sequences 
scattered throughout the genome. In addition to requiring extensive sequencing, the bioinformatics 
software may mischaracterize the genomic sequences obtained, i.e., labeling non-coding DNA as 
coding DNA and vice versa. 

20 An alternative approach takes a more direct route to identifying and characterizing human 

genes. In this approach, complementary DNAs (cDNAs) are synthesized from isolated messenger 
RNAs (mRNAs) which encode human proteins. Using this approach, sequencing is only performed on 
DNA which is derived from protein coding portions of the genome. Often, only short stretches of the 
cDNAs are sequenced to obtain sequences called expressed sequence tags (ESTs). The ESTs may then 

25 be used to isolate or purify extended cDNAs which include sequences adjacent to the EST sequences. 
The extended cDNAs may contain all of the sequence of the EST which was used to obtain them or only 
a portion of the sequence of the EST which was used to obtain them. In addition, the extended cDNAs 
may contain the full coding sequence of the gene from which the EST was derived or, alternatively, the 
extended cDNAs may include portions of the coding sequence of the gene from which the EST was 

30 derived. It will be appreciated that there may be several extended cDNAs which include the EST 
sequence as a result of alternate splicing or the activity of alternative promoters. Alternatively, ESTs 
having partially overlapping sequences may be identified and contigs comprising the consensus 
sequences of the overlapping ESTs may be identified. 

In the past, these short EST sequences were often obtained from oligo-dT primed cDNA 

35 libraries. Accordingly, they mainly corresponded to the 3' untranslated region of the mRNA. In part, 
the prevalence of EST sequences derived from the 3' end of the mRNA is a result of the fact that typical 
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techniques for obtaining cDNAs, are not well suited for isolating cDNA sequences derived from the 5' 
ends ofmRNAs (Adams e/ a/., Mi/wre 377:3-174, 1996, Hillier e/a/.. Genome Res, 6:807-828, 1996). 

In addition, in those reported instances where longer cDNA sequences have been obtained, the 
reported sequences typically correspond to coding sequences and do not include the full 5' untranslated 
5 region (5*UTR) of the mRNA from which the cDNA is derived. Indeed, 5'UTRs have been shown to 
affect either the stability or translation of mRNAs. Thus, regulation of gene expression may be achieved 
through the use of alternative 5'UTRs as shown, for instance, for the translation of the tissue inhibitor 
of metalloprotease mRNA in mitogenically activated cells (Waterhouse et al, J Biol Chem. 265:5585- 
9. 1990), Furthermore, modification of 5'UTR through mutation, insertion or translocation events 

10 may even be implied in pathogenesis. For instance, the fragile X syndrome, the most common cause 
of inherited mental retardation, is partly due to an insertion of multiple CGG trinucleotides in the 
5'UTR of the fragile X mRNA resulting in the inhibition of protein synthesis via ribosome stalling 
(Feng et al Science 268:73 1-4, 1995). An abenrant mutation in regions of the 5'UTR known to 
inhibit translation of the proto-oncogene c-myc was shown to result in upregulation of c-myc protein 

15 levels in cells derived from patients with multiple myelomas (Willis et al Curr Top Microbiol 

Immunol 224:269-76, 1997). In addition, the use of oligo-dT primed cDNA libraries does not allow the 
isolation of complete 5'UTRs since such incomplete sequences obtained by this process may not include 
the first exon of the mRNA, particularly in situations where the first exon is short. Furthermore, they 
may not include some exons, often short ones, which are located upstream of splicing sites. Thus, there 

20 is a need to obtain sequences derived from the 5' ends of mRNAs. 

While many sequences derived from human chromosomes have practical applications, 
approaches based on the identification and characterization of those chromosomal sequences which 
encode a protein product are particularly relevant to diagnostic and therapeutic uses, hi some instances, 
the sequences used in such therapeutic or diagnostic techniques may be sequences which encode proteins 

25 which are secreted from the cell in which they are synthesized. Those sequences encoding secreted 
proteins as well as the secreted proteins themselves, are particularly valuable as potential therapeutic 
agents. Such proteins are often involved in cell to cell communication and may be responsible for 
producing a clinically relevant response in their target cells. In fact, several secretory proteins, including 
tissue plasminogen activator, G-CSF, GM-CSF, erythropoietin, human growth hormone, insulin, 

30 interferon-a, interferon-P, interferon-y, and interleukin-2, are currently in clinical use. These proteins 
are used to treat a wide range of conditions, including acute myocardial infarction, acute ischemic stroke, 
anemia, diabetes, growth hormone deficiency, hepatitis, kidney carcinoma, chemotherapy-induced 
neutropenia and multiple sclerosis. For these reasons, extended cDNAs encoding secreted proteins or 
portions thereof represent a valuable source of therapeutic agents. Thus, there is a need for the 

35 identification and characterization of secreted proteins and the nucleic acids encoding them. 

In addition to being therapeutically useful themselves, secretory proteins include short peptides, 
called signal peptides, at their amino termini which direct their secretion. These signal peptides are 
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encoded by the signal sequences located at the 5' ends of the coding sequences of genes encoding 
secreted proteins. These signal peptides can be used to direct the extracellular secretion of any protein to 
which they are operably linked. In addition, portions of the signal peptides called membrane- 
translocating sequences, may also be used to direct the intracellular import of a peptide or protein of 
5 interest. This may prove beneficial in gene therapy strategies in which it is desired to deliver a particular 
gene product to cells other than the cells in which it is produced. Signal sequences encoding signal 
peptides also find application in simplifying protein purification techniques. In such applications, the 
extracellular secretion of the desired protein greatly facilitates purification by reducing the number of 
undesired proteins from which the desired protein must be selected. Thus, there exists a need to identify 

10 and characterize the 5' portions of the genes for secretory proteins which encode signal peptides. 

Sequences coding for non-secreted proteins may also find application as therapeutics or 
diagnostics. In particular, such sequences may be used to determine whether an individual is likely to 
express a detectable phenotype, such as a disease, as a consequence of a mutation in the coding sequence 
of a protein. In instances where the individual is at risk of suffering from a disease or other undesirable 

1 5 phenotype as a result of a mutation in such a coding sequence, the undesirable phenotype may be 

corrected by introducing a normal coding sequence using gene therapy. Alternatively, if the undesirable 
phenotype results from overexpression of the protein encoded by the coding sequence, expression of the 
protein may be reduced using antisense or triple helix based strategies. 

The secreted or non-secreted human polypeptides encoded by the coding sequences may also be 

20 used as therapeutics by administering them directly to an individual having a condition, such as a 
disease, resulting from a mutation in the sequence encoding the polypeptide. In such an instance, the 
condition can be cured or ameliorated by administering the polypeptide to the individual. 

In addition, the secreted or non-secreted human polypeptides or portions thereof may be used to 
generate antibodies useful in determining the tissue type or species of origin of a biological sample. The 

25 antibodies may also be used to determine the cellular localization of the secreted or non-secreted human 
polypeptides or the cellular localization of polypeptides which have been fused to the human 
polypeptides. In addition, the antibodies may also be used in immunoaffinity chromatography 
techniques to isolate, purify, or enrich the human polypeptide or a target polypeptide which has been 
ftised to the human polypeptide. 

30 Public information on the number of human genes for which the promoters and upstream 

regulatory regions have been identified and characterized is quite limited. In part, this may be due to the 
difficulty of isolating such regulatory sequences. Upstream regulatory sequences such as transcription 
factor binding sites are typically too short to be utilized as probes for isolating promoters from human 
genomic libraries. Recently, some approaches have been developed to isolate human promoters. One of 

35 them consists of making a CpG island library (Cross et aL Nature Genetics 6: 236-244, 1 994). The 
second consists of isolating human genomic DNA sequences containing Spel binding sites by the use of 
Spel binding protein. (Mortlock et aL Genome Res. 6:327-335, 1996). Both of these approaches have 
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their limits due to a lack of specificity and of comprehensiveness. Thus, there exists a need to identify 
and systematically characterize the 5' portions of the genes. 

The present 5' ESTs may be used to efficiently identify and isolate 5'UTRs and upstream 
regulatory regions which control the location, developmental stage, rate, and quantity of protein 
5 synthesis, as well as the stability of the mRNA. Once identified and characterized, these regulatory 
regions may be utilized in gene therapy or protein purification schemes to obtain the desired amount and 
locations of protein synthesis or to inhibit, reduce, or prevent the synthesis of undesirable gene products. 

In addition, ESTs containing the 5' ends of protein genes may include sequences useful as 
probes for chromosome mapping and the identification of individuals. Thus, there is a need to identify 
10 and characterize the sequences upstream of the 5' coding sequences of genes. 

Summary of the Invention 
The present invention relates to purified, isolated, or enriched 5' ESTs which include sequences 
derived from the authentic 5' ends of their corresponding mRNAs. The term "corresponding mRNA" 

1 5 refers to the mRNA which was the template for the cDN A synthesis which produced the 5' EST. These 
sequences will be referred to hereinafter as "5* ESTs." The present invention also includes purified, 
isolated or enriched nucleic acids comprising contigs assembled by determining a consensus sequences 
from a plurality of ESTs containing overlapping sequences. These contigs will be referred to herein as 
^^consensus contigated 5*ESTs." 

20 As used herein, the term "purified" does not require absolute purity; rather, it is intended as a 

relative definition. Individual 5' EST clones isolated from a cDNA library have been conventionally 
purified to clectrophoretic homogeneity. The sequences obtained from these clones could not be 
obtained directly either from the library or from total human DNA. The cDNA clones are not naturally 
occurring as such, but rather are obtained via manipulation of a partially purified naturally occurring 

25 substance (messenger RNA). The conversion of mRNA into a cDNA hbrary involves the creation of a 
synthetic substance (cDNA) and pure individual cDNA clones can be isolated from the synthetic library 
by clonal selection. Thus, creating a cDNA library from messenger RNA and subsequently isolating 
individual clones from that library results in an approximately 10'*- 10^ fold purification of the native 
message. Purification of starting material or natural material to at least one order of magnitude, 

30 preferably two or three orders, and more preferably four or five orders of magnitude is expressly 
contemplated. 

As used herein, the term "isolated" requires that the material be removed from its original 
environment (e.g., the natural environment if it is naturally occurring). For example, a naturally- 
occurring polynucleotide present in a living animal is not isolated, but the same polynucleotide, 
35 separated from some or all of the coexisting materials in the natural system, is isolated. 

As used herein, the term "recombinant" means that the 5' EST is adjacent to "backbone" nucleic 
acid to which it is not adjacent in its natural environment. Additionally, to be "enriched" the 5' ESTs will 
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represent 5% or more of the number of nucleic acid inserts in a population of nucleic acid backbone 
molecules. Backbone molecules according to the present invention include nucleic acids such as 
expression vectors, self-replicating nucleic acids, viruses, integrating nucleic acids, and other vectors or 
nucleic acids used to maintain or manipulate a nucleic acid insert of interest. Preferably, the enriched 5' 
5 ESTs represent 15% or more of the number of nucleic acid inserts in the population of recombinant 
backbone molecules. More preferably, the enriched 5' ESTs represent 50% or more of the number of 
nucleic acid inserts in the population of recombinant backbone molecules. In a highly preferred 
embodiment, the enriched 5' ESTs represent 90% or more of the number of nucleic acid inserts in the 
population of recombinant backbone molecules, 

1 0 "Stringent," "moderate," and "low" hybridization conditions are as defined below. 

The term "polypeptide" refers to a polymer of amino acids without regard to the length of the 
polymer; thus, peptides, oligopeptides, and proteins are included within the definition of polypeptide. 
This term also does not specify or exclude post-expression modifications of polypeptides, for 
example, polypeptides which include the covalent attachment of glycosyl groups, acetyl groups, 

1 5 phosphate groups, lipid groups and the like are expressly encompassed by the term polypeptide. 
Also included within the definition are polypeptides which contain one or more analogs of an amino 
acid (including, for example, non-naturally occurring amino acids, amino acids which only occur 
naturally in an unrelated biological system, modified amino acids from mammalian systems etc.), 
polypeptides with substituted linkages, as well as other modifications known in the art, both naturally 

20 occurring and non-naturally occurring. 

As used interchangeably herein, the terms "nucleic acids," "oligonucleotides," and 
"polynucleotides" include RNA, DNA, or RNA/DNA hybrid sequences of more than one nucleotide 
in either single chain or duplex form. The term "nucleotide" as used herein as an adjective to 
describe molecules comprising RNA, DNA, or RNA/DNA hybrid sequences of any length in single- 

25 stranded or duplex form. The term "nucleotide" is also used herein as a noun to refer to individual 
nucleotides or varieties of nucleotides, meaning a molecule, or individual unit in a larger nucleic acid 
molecule, comprising a purine or pyrimidine, a ribose or deoxyribose sugar moiety, and a phosphate 
group, or phosphodiester linkage in the case of nucleotides within an oligonucleotide or 
polynucleotide. Although the term "nucleotide" is also used herein to encompass "modified 

30 nucleotides" which comprise at least one modifications (a) an alternative linking group, (b) an 
analogous form of purine, (c) an analogous form of pyrimidine, or (d) an analogous sugar, for 
examples of analogous linking groups, purine, pyrimidines, and sugars see for example PCT 
publication No. WO 95/04064. The polynucleotide sequences of the invention may be prepared by 
any known method, including synthetic, recombinant, ex vivo generation, or a combination thereof, 

35 as well as utilizing any purification methods known in the art. 

The terms "base paired" and "Watson & Crick base paired" are used interchangeably herein 
to refer to nucleotides which can be hydrogen bonded to one another be virtue of their sequence 
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identities in a manner like that found in double-helical DNA with thymine or uracil residues linked to 
adenine residues by two hydrogen bonds and cytosine and guanine residues linked by three hydrogen 
bonds (See Stryer, L., Biochemistry, 4*^ edition, 1995). 

The terms "complementary" or "complement thereof are used herein to refer to the 
5 sequences of polynucleotides which are capable of forming Watson & Crick base pairing with 
another specified polynucleotide throughout the entirety of the complementary region. For the 
purpose of the present invention, a first polynucleotide is deemed to be complementary to a second 
polynucleotide when each base in the first polynucleotide is paired with its complementary base. 
Complementary bases are, generally, A and T (or A and U), or C and G. "Complement" is used 

10 herein as a synonym from "complementary polynucleotide," "complementary nucleic acid" and 
"complementary nucleotide sequence" . These terms are applied to pairs of polynucleotides based 
solely upon their sequences and not any particular set of conditions under which the two 
polynucleotides would actually bind. Preferably, a "complementary" sequence is a sequence which an 
A at each position where there is a T on the opposite strand, a T at each position where there is an A on 

1 5 the opposite strand, a G at each position where there is a C on the opposite strand and a C at each 
position where there is a G on the opposite strand. 

Thus, 5' ESTs in cDNA libraries in which one or more 5' ESTs make up 5% or more of the 
number of nucleic acid inserts in the baickbone molecules are "enriched recombinant 5' ESTs" as defined 
herein. Likewise, 5' ESTs in a population of plasmids in which one or more 5' ESTs of the present 

20 invention have been inserted such that they represent 5% or more of the number of inserts in the plasmid 
backbone are "enriched recombinant 5' ESTs" as defined herein. However, 5' ESTs in cDNA libraries in 
which 5* ESTs constitute less than 5% of the number of nucleic acid inserts in the population of 
backbone molecules, such as libraries in which backbone molecules having a 5' EST insert are 
extremely rare, are not "enriched recombinant 5' ESTs." 

25 In some embodiments, the present invention relates to 5' ESTs which are derived from genes 

encoding secreted proteins. As used herein, a "secreted" protein is one which, when expressed in a 
suitable host cell, is transported across or through a membrane, including transport as a result of signal 
peptides in its amino acid sequence. "Secreted" proteins include without limitation proteins secreted 
wholly (e.g. soluble proteins), or partially (e.g. receptors) from the cell in which they are expressed. 

30 "Secreted" proteins also include without limitation proteins which are transported across the membrane 
of the endoplasmic reticulum. 

Such 5' ESTs include nucleic acid sequences, called signal sequences, which encode signal 
peptides which direct the extracellular secretion of the proteins encoded by the genes from which the 5' 
ESTs are derived. Generally, the signal peptides are located at the amino termini of secreted proteins. 

35 Secreted proteins are translated by ribosomes associated with the "rough" endoplasmic 

reticulum. Generally, secreted proteins are co-translationally transferred to the membrane of the 
endoplasmic reticulum. Association of the ribosome with the endoplasmic reticulum during translation 
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of secreted proteins is mediated by the signal peptide. The signal peptide is typically cleaved following 
its co-translational entry into the endoplasmic reticulum. After delivery to the endoplasmic reticulum, 
secreted proteins may proceed through the Golgi apparatus. In the Golgi apparatus, the proteins may 
undergo post-translational modification before entering secretory vesicles which transport them across 
5 the cell membrane. 

The 5' ESTs of the present invention have several important applications. For example, they 
may be used to obtain and express cDNA clones which include the ftill protein coding sequences of the 
corresponding gene products, including the authentic translation start sites derived from the 5' ends of 
the coding sequences of the mRNAs from which the 5' ESTs are derived. These cDNAs will be referred 

10 to hereinafter as "fiilHength cDNAs." These cDNAs may also include DNA derived from mRNA 

sequences upstream of the translation start site. The full-length cDNA sequences may be used to express 
the proteins corresponding to the 5' ESTs. As discussed above, secreted proteins and non-secreted 
proteins may be therapeutically important. Thus, the proteins expressed from the cDNAs may be useful 
in treating and controlling a variety of human conditions. The 5' ESTs may also be used to obtain the 

1 5 corresponding genomic DNA. The term "corresponding genomic DNA" refers to the genomic DNA 
which encodes the mRNA from which the 5' EST was derived. 

Alternatively, the 5' ESTs may be used to obtain and express extended cDNAs encoding 
portions of the protein. In the case of secreted proteins, the portions may comprise the signal peptides of 
the secreted proteins or the mature proteins generated when the signal peptide is cleaved off. 

20 The present invention includes isolated, purified, or enriched "EST-related nucleic acids." The 

terms "isolated," "purified" or "enriched" have the meanings provided above. As used herein, the term 
"EST-related nucleic acids" means the nucleic acids of SEQ ID NOs. 24-81 1 and 1 600-1622, extended 
cDNAs obtainable using the nucleic acids of SEQ ID NOs. 24-81 1 and 1600-1622, full-length cDNAs 
obtainable using the nucleic acids of SEQ ID NOs. 24-81 1 and 1600-1622 or genomic DNAs obtainable 

25 using the nucleic acids of SEQ ID NOs. 24-8 1 1 and 1 600-1 622. The present invention also includes the 
sequences complementary to the EST-related nucleic acids. 

The present invention also includes isolated, purified, or enriched '^fragments of EST-related 
nucleic acids." The terms "isolated," "purified" and "enriched" have the meanings described above. As 
used herein the term "fragments of EST-related nucleic acids" means fragments comprising at least 10, 

30 12, 15, 18, 20, 23, 25, 28, 30, 35, 40, 50, 75, 100, 200, 300, 500, or 1000 consecutive nucleotides of the 
EST-related nucleic acids to the extent that fragments of these lengths are consistent with the lengths of 
the particular EST-related nucleic acids being referenced. In particular, fragments of EST-related 
nucleic acids refer to "polynucleotides described in Table 11," "polynucleotides described in Table III," 
and "polynucleotides described in Table IV." The present invention also includes the sequences 

35 complementary to the fragments of the EST-related nucleic acids. 

The present invention also includes isolated, purified, or enriched "positional segments of EST- 
related nucleic acids." As used herein, the term "positional segments of EST-related nucleic acids" 
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includes segments comprising nucleotides 1-25,26-50,51-75, 76-100, 101-125, 126-150, 151-175, 176- 
200, 201-225, 226-250, 251-300, 301-325, 326-350, 351-375, 376-400, 401-425, 426-450, 45M75, 
476-500, 501-525, 526-550, 551-575, 576-600 and 601-the terminal nucleotide of the EST-related 
nucleic acids to the extent that such nucleotide positions are consistent with the lengths of the particular 
5 EST-related nucleic acids being referenced. The term "positional segments of EST-related nucleic acids 
also includes segments comprising nucleotides 1-50, 51-100, 101-150, 151-200, 201-250, 251-300, 301- 
350, 35 1-400, 401-450, 450-500, 501-550, 551-600 or 601-the terminal nucleotide of the EST-related 
nucleic acids to the extent that such nucleotide positions are consistent with the lengths of the particular 
EST-related nucleic acids being referenced. The term "positional segments of EST-related nucleic 

10 acids" also includes segments comprising nucleotides 1-100, 101-200, 201-300, 301-400, 501-500, 500- 
600, or 601-the terminal nucleotide of the EST-related nucleic acids to the extent that such nucleotide 
positions are consistent with the lengths of the particular EST-related nucleic acids being referenced. In 
addition, the term "positional segments of EST-related nucleic acids" includes segments comprising 
nucleotides 1-200, 201-400, 400-600, or 601-the terminal nucleotide of the EST-related nucleic acids to 

1 5 the extent that such nucleotide positions are consistent with the lengths of the particular EST-related 
nucleic acids being referenced. The present invention also includes the sequences complementary to the 
positional segments of EST-related nucleic acids. 

The present invention also includes isolated, purified, or enriched "fragments of positional 
segments of EST-related nucleic acids." As used herein, the term "fragments of positional segments of 

20 EST-related nucleic acids" refers to fragments comprising at least 10, 15, 18, 20, 23, 25, 28, 30, 35, 40, 
50, 75, 100, 150, or 200 consecutive nucleotides of the positional segments of EST-related nucleic acids. 
The present invention also includes the sequences complementary to the fragments of positional 
segments of EST-related nucleic acids. 

The present invention also includes isolated or purified "EST-related polypeptides." As used 

25 herein, the term "EST-related polypeptides" means the polypeptides encoded by the EST-related nucleic 
acids, including the polypeptides of SEQ ID NOs. 812-1599. 

The present invention also includes isolated or purified "fragments of EST-related 
polypeptides." As used herein, the term "fragments of EST-related polypeptides" means fragments 
comprising at least 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids of an EST- 

30 related polypeptide to the extent that fragments of these lengths are consistent with the lengths of the 
particular EST-related polypeptides being referenced. In particular, fragments of EST-related 
polypepides refer to polypeptides encoded by "polynucleotides described in Table II," "polynucleotides 
described in Table III," and "polynucleotides described in Table IV." 

The present invention also includes isolated or purified "positional segments of EST-related 

35 polypeptides." As used herein, the term "positional segments of EST-related polypeptides" includes 
polypeptides comprising amino acid residues 1-25, 26-50, 51-75, 76-100, 101-125, 126-150, 151-175, 
176-200, or 201-the C-terminal amino acid of the EST-related polypeptides to the extent that such amino 
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acid residues are consistent with the lengths of the particular EST-related polypeptides being referenced. 
The term "positional segments of EST-related polypeptides also includes segments comprising amino 
acid residues 1-50,5M00, 101-150, 151-200 or 201-the C-terminal amino acid of the EST-related 
polypeptides to the extent that such amino acid residues are consistent with the lengths of the particular 
5 EST-related polypeptides being referenced. The term "positional segments of EST-related polypeptides" 
also includes segments comprising amino acids 1-100 or 101-200 of the EST-related polypeptides to the 
extent that such amino acid residues are consistent with the lengths of particular EST-related 
polypeptides being referenced. In addition, the term "positional segments of EST-related polypeptides" 
includes segments comprising amino acid residues 1-200 or 201-the C-terminal amino acid of the EST- 

1 0 related polypeptides to the extent that amino acid residues are consistent with the lengths of the 
particular EST-related polypeptides being referenced. 

The present invention also includes isolated or purified "fragments of positional segments of 
EST-related polypeptides." As used herein, the term "fragments of positional segments of EST-related 
polypeptides" means fragments comprising at least 5, 1 0, 1 5, 20, 25, 30, 35, 40, 50, 75, 1 00, or 1 50 

1 5 consecutive amino acids of positional segments of EST-related polypeptides to the extent that fragments 
of these lengths are consistent vnth the lengths of tiie particular EST-related polypeptides being 
referenced. 

The present invention also includes antibodies which specifically recognize the EST-related 
polypeptides, fragments of EST-related polypeptides, positional segments of EST-related polypeptides, 
20 or fragments of positional segments of EST-related polypeptides. In the case of secreted proteins, such 
as those of SEQ ID NOs. 1 554-1580 antibodies which specifically recognize the mature protein 
generated when the signal peptide is cleaved may also be obtained as described below. Similarly, 
antibodies which specifically recognize the signal peptides of SEQ ID NOs. 812-1516 or 1554-1580 may 
also be obtained. 

25 In some embodiments and in the case of secreted proteins, the EST-related nucleic acids, 

fragments of EST-related nucleic acids, positional segments of EST-related nucleic acids, or fragments 
of positional segments of nucleic acids include a signal sequence. In other embodiments, the EST- 
related nucleic acids, fragments of EST-related nucleic acids, positional segments of EST-related nucleic 
acids, or fragments of positional segments of nucleic acids may include the fall coding sequence for the 

30 protein or, in the case of secreted proteins, the fall coding sequence of the mature protein (i.e. the protein 
generated when the signal polypeptide is cleaved off). In addition, the EST-related nucleic acids, 
fragments of EST-related nucleic acids, positional segments of EST-related nucleic acids, or fragments 
of positional segments of nucleic acids may include regulatory regions upstream of the translation start 
site or downstream of the stop codon which control the amount, location, or developmental stage of gene 

35 expression. 

As discussed above, both secreted and non-secreted human proteins may be therapeutically 
important. Thus, the proteins expressed from the EST-related nucleic acids, fragments of EST-related 
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nucleic acids, positional segments of EST-related nucleic acids, or fragments of positional segments of 
nucleic acids may be useful in treating or controlling a variety of human conditions. 

The EST-related nucleic acids, fragments of EST-related nucleic acids, positional segments of 
EST-related nucleic acids, or fragments of positional segments of nucleic acids may be used in forensic 
5 procedures to identify individuals or in diagnostic procedures to identify individuals having genetic 
diseases resulting from abnormal gene expression. In addition, the EST-related nucleic acids, fragments 
of EST-related nucleic acids, positional segments of EST-related nucleic acids, or fragments of 
positional segments of nucleic acids are useful for constructing a high resolution map of the human 
chromosomes. 

10 The present invention also relates to secretion vectors capable of directing the secretion of a 

protein of interest. Such vectors may be used in gene therapy strategies in which it is desired to produce 
a gene product in one cell which is to be delivered to another location in the body. Secretion vectors 
may also facilitate the purification of desired proteins. 

The present invention also relates to expression vectors capable of directing the expression of an 

1 5 inserted gene in a desired spatial or temporal manner or at a desired level. Such vectors may include 
sequences upstream of the EST-related nucleic acids, fragments of EST-related nucleic acids, positional 
segments of EST-related nucleic acids, or fragments of positional segments of nucleic acids, such as 
promoters or upstream regulatory sequences. 

The present invention also comprises fusion vectors for making chimeric polypeptides 

20 comprising a first polypeptide and a second polypeptide. Such vectors are useftil for determining the 
cellular localization of the chimeric polypeptides or for isolating, purifying or enriching the chimeric 
polypeptides. 

The EST-related nucleic acids, fragments of EST-related nucleic acids, positional segments of 
EST-related nucleic acids, or fragments of positional segments of nucleic acids may also be used for 

25 gene therapy to control or treat genetic diseases. In the case of secreted proteins, signal peptides may be 
fused to heterologous proteins to direct their extracellular secretion. 

Bacterial clones containing Bluescript plasmids having inserts containing the sequence of the 
non-aligned 5*ESTs, also referred to as singletons, and sequences of the 5'ESTs which were aligned to 
yield consensus contigated 5' ESTs are presently stored at 80''C in 4% (v/v) glycerol in the inventor's 

30 laboratories under internal designations. The non-aligned 5 'ESTs are those which comprise a single 
EST from a single tissue in the listing of Table V. The inserts may be recovered from the stored 
materials by growing the appropriate clones on a suitable medium. The Bluescript DNA can then be 
isolated using plasmid isolation procedures familiar to those skilled in the art such as alkaline lysis 
minipreps or large scale alkaline lysis plasmid isolation procedures. If desired the plasmid DNA may be 

35 further enriched by centrifiigation on a cesium chloride gradient, size exclusion chromatography, or 
anion exchange chromatography. The plasmid DNA obtained using these procedures may then be 
manipulated using standard cloning techniques familiar to those skilled in the art. Alternatively, a PGR 
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can be performed with primers designed at both ends of the inserted EST-related nucleic acids, 
iragments of EST-related nucleic acids, positional segments of EST-related nucleic acids, or fragments 
of positional segments of nucleic acids. The PCR product which corresponds to the EST-related nucleic 
acids, fragments of EST-related nucleic acids, positional segments of EST-related nucleic acids, or 
5 fragments of positional segments of nucleic acids can then be manipulated using standard cloning 
techniques familiar to those skilled in the art. 

One embodiment of the present invention is a purified nucleic acid comprising a sequence 
selected from the group consisting of SEQ ID NOs. 24-81 1 and SEQ ID NOs. 1600-1622 and 
sequences complementary to the sequences of SEQ ID NOs. 24-81 1 and SEQ ED NOs. 1600-1622. 
10 Another embodiment of the present invention is a purified nucleic acid comprising at least 

10, 12, 15. 18. 20, 23, 25, 28, 30, 35, 40. 50, 75, 100, 200, 300, 500, or 1000 consecutive nucleotides, to 
the extent that fragments of these lengths are consistent with the specific sequence, of a sequence 
selected from the group consisting of SEQ ID NOs. 24-81 1 and SEQ ID NOs. 1600-1622 and 
sequences complementary to the sequences of SEQ ID NOs. 24-81 1 and SEQ ID NOs. 1600-1622. 
15 A further embodiment of the present invention is a purified nucleic acid comprising the 

coding sequence of a sequence selected from the group consisting of SEQ ID NOs. 24-8 11. 

Yet another embodiment of the present invention is a purified nucleic acid comprising the 
full coding sequences of a sequence selected from the group consisting of SEQ ID NOs. 766-792 
wherein the full coding sequence comprises the sequence encoding the signal peptide and the 
20 sequence encoding the mature protein. 

Still another embodiment of the present invention is a purified nucleic acid comprising a 
contiguous span of a sequence selected from the group consisting of SEQ ID NOs. 766-792 which 
encodes the mature protein. 

Another embodiment of the present invention is a purified nucleic acid comprising a 
25 contiguous span of a sequence selected from the group consisting of SEQ ID NOs. 24-728 and 766- 
792 which encodes the signal peptide. 

Another embodiment of the present invention is a purified nucleic acid encoding a 
polypeptide comprising a sequence selected from the group consisting of the sequences of SEQ ID 
NOs. 812-1599. 

30 Another embodiment of the present invention is a purified nucleic acid encoding a 

polypeptide comprising a sequence selected from the group consisting of the sequences of SEQ ID 
NOs. 1554-1580. 

Another embodiment of the present invention is a purified nucleic acid encoding a 
polypeptide comprising a mature protein included in a sequence selected from the group consisting of 
35 the sequences of SEQ ED NOs. 1 554-1 580. 
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Another embodiment of the present invention is a purified nucleic acid encoding a 
polypeptide comprising a signal peptide included in a sequence selected from the group consisting of 
the sequences of SEQ ID NOs, 812-1516 and 1554-1580. 

Another embodiment of the present invention is a purified nucleic acid at least 30, 35, 40, 50, 
5 75, 100, 200, 300, 500 or 1000 nucleotides in length which hybridizes under stringent conditions to a 
sequence selected from the group consisting of SEQ ID NOs. 24-81 1 and SEQ ID NOs. 1600-1622 
and sequences complementary to the sequences of SEQ ED NOs. 24-81 1 and SEQ ID NOs. 1600- 
1622. 

Another embodiment of the present invention is a purified or isolated polypeptide comprising 
10 a sequence selected from the group consisting of the sequences of SEQ ID NOs. 812-1599. 

Another embodiment of the present invention is a purified or isolated polypeptide comprising 
a sequence selected from the group consisting of SEQ ID NOs. 1554-1580. 

Another embodiment of the present invention is a purified or isolated polypeptide comprising 
a mature protein of a polypeptide selected from the group consisting of SEQ ED NOs. 1554-1580. 
15 Another embodiment of the present invention is a purified or isolated polypeptide comprising 

a signal peptide of a sequence selected from the group consisting of the polypeptides of SEQ ID 
NOs. 812-1516 and 1554-1580. 

Another embodiment of the present invention is a purified or isolated polypeptide comprising 
at least 12, 15, 18, 20, 23, 25, 28, 30, 35, 40, 50, 75, 100, 200, 300, 500, or 1000 consecutive amino 
20 acids, to the extent that fragments of these lengths are consistent with the specific sequence, of a 
sequence selected from the group consisting of the sequences of SEQ ED NOs. 812-1599. 

Another embodiment of the present invention is a method of making a cDNA comprising the 
steps of contacting a collection of mRNA molecules from human cells with a primer comprising at 
least 12, 15, 18, 20, 23, 25, 28, 30, 35, 40, or 50 consecutive nucleotides of a sequence selected from 
25 the group consisting of the sequences complementary to SEQ ID NOs. 24-81 1 and SEQ ID NOs. 
1600-1622, hybridizing said primer to an mRNA in said collection that encodes said protein reverse 
transcribing said hybridized primer to make a first cDNA strand from said mRNA, making a second 
cDNA strand complementary to said first cDNA strand and isolating the resulting cDNA encoding 
said protein comprising said first cDNA strand and said second cDNA strand. 
30 Another embodiment of the present invention is a purified cDNA obtainable by the method 

of the preceding paragraph. 

In one aspect of this embodiment, the cDNA encodes at least a portion of a human 
polypeptide. 

Another embodiment of the present invention is a method of making a cDNA comprising the 
35 steps of obtaining a cDNA comprising a sequence selected from the group consisting of SEQ ED 
NOs. 24-81 1 and SEQ ID NOs. 1600-1622, contacting said cDNA with a detectable probe 
comprising at least 12, 15, 18, 20, 23, 25, 28, 30, 35, 40, or 50 consecutive nucleotides of a sequence 
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selected from the group consisting of SEQ ID NOs. 24-81 1 and SEQ ID NOs. 1600-1622 and the 
sequences complementary to SEQ ID NOs. 24-81 1 and SEQ ID NOs. 1600-1622 under conditions 
which permit said probe to hybridize to said cDNA, identifying a cDNA which hybridizes to said 
detectable probe, and isolating said cDNA which hybridizes to said probe. 
5 Another embodiment of the present invention is a purified cDNA obtainable by the method 

of the preceding paragraph. 

In one aspect of this embodiment, the cDNA encodes at least a portion of a human 
polypeptide. 

Another embodiment of the present invention is a method of making a cDNA comprising the 
1 0 steps of contacting a collection of mRNA molecules from human cells with a first primer capable of 
hybridizing to the polyA tail of said mRNA, hybridizing said first primer to said polyA tail, reverse 
transcribing said mRNA to make a first cDNA strand, making a second cDNA strand complementary 
to said first cDNA strand using at least one primer comprising at least 12, 15, 18, 20, 23, 25, 28, 30, 
35, 40, or 50 consecutive nucleotides of a sequence selected from the group consisting of SEQ ID 
15 NOs. 24-81 1 and SEQ ID NOs. 1600-1622, and isolating the resulting cDNA comprising said first 
cDNA strand and said second cDNA strand. 

Another embodiment of the present invention is a purified cDNA obtainable by the method 
of the preceding paragraph. 

In one aspect of this embodiment, said cDNA encodes at least a portion of a human 
20 polypeptide. 

In another aspect of the preceding method the second cDNA strand is made by contacting 
said first cDNA strand with a first pair of primers, said first pair of primers comprising a second 
primer comprising at least 12, 15, 18, 20, 23, 25, 28, 30, 35, 40, or 50 consecutive nucleotides of a 
sequence selected from the group consisting of SEQ ID NOs. 24-8 11 and SEQ ID NOs. 1600-1622 

25 and a third primer having a sequence therein which is included within the sequence of said first 
primer, performing a first polymerase chain reaction with said first pair of primers to generate a first 
PGR product, contacting said first PGR product with a second pair of primers, said second pair of 
primers comprising a fourth primer, said fourth primer comprising at least 12, 15, 18, 20, 23, 25, 28, 
30, 35, 40, or 50 consecutive nucleotides of said sequence selected from the group consisting of SEQ 

30 ID NOs. 24-81 1 and SEQ ID NOs. 1600-1622, and a fifth primer, wherein said fourth and fifth 
hybridize to sequences within said first PGR product, and performing a second polymerase chain 
reaction, thereby generating a second PGR product. 

One aspect of this embodiment is a purified cDNA obtainable by the method of the 
preceding paragraph. 

35 In another aspect of this embodiment, said cDNA encodes at least a portion of a human 

polypeptide. 
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Alternatively, the second cDNA strand may be made by contacting said first cDNA strand 
with a second primer comprising at least 12, 15, 18, 20, 23, 25, 28, 30, 35, 40, or 50 consecutive 
nucleotides of a sequence selected from the group consisting of SEQ ID NOs. 24-81 1 and SEQ ID 
NOs. 1600-1622, hybridizing said second primer to said first strand cDNA, and extending said 
5 hybridized second primer to generate said second cDNA strand. 

One aspect of the above embodiment is a purified cDNA obtainable by the method of the 
preceding paragraph. 

In a further aspect of this embodiment said cDNA encodes at least a portion of a human 
polypeptide. 

10 Another embodiment of the present invention is a method of making a polypeptide 

comprising the steps of obtaining a cDNA which encodes a polypeptide encoded by a nucleic acid 
comprising a sequence selected from the group consisting of SEQ ID NOs. 24-81 1 or a cDNA which 
encodes a polypeptide comprising at least 6, 8, 10, 12, 15, 18, 20, 23, 25, 28, 30, 35, 40, or 50 
consecutive amino acids of a polypeptide encoded by a sequence selected from the group consisting 

1 5 of SEQ ED NOs. 24-8 1 1 , inserting said cDNA in an expression vector such that said cDNA is 

operably linked to a promoter, introducing said expression vector into a host cell whereby said host 
cell produces the protein encoded by said cDNA, and isolating said protein. 

Another aspect of this embodiment is an isolated protein obtainable by the method of the 
preceding paragraph. 

20 Another embodiment of the present invention is a method of obtaining a promoter DNA 

comprising the steps of obtaining genomic DNA located upstream of a nucleic acid comprising a 
sequence selected from the group consisting of SEQ ED NOs. 24-81 1 and SEQ ID NOs. 1600-1622 
and the sequences complementary to the sequences of SEQ BD NOs. 24-81 1 and SEQ ID NOs. 1 600- 
1622, screening said genomic DNA to identify a promoter capable of directing transcription 

25 initiation, and isolating said DNA comprising said identified promoter. 

In one aspect of this embodiment, said obtaining step comprises walking from genomic DNA 
comprising a sequence selected from the group consisting of SEQ BD NOs. 24-8 1 1 and SEQ ID NOs. 
1600-1622 and the sequences complementary to SEQ ED NOs. 24-81 1 and SEQ ID NOs. 1600-1622. 
In another aspect of this embodiment, said screening step comprises inserting genomic DNA located 

30 upstream of a sequence selected from the group consisting of SEQ ID NOs. 24-8 1 1 and SEQ ED 

NOs. 1600-1622 and the sequences complementary to SEQ ED NOs. 24-81 1 and SEQ ID NOs, 1600- 
1622 into a promoter reporter vector. For example, said screening step may comprise identifying 
motifs in genomic DNA located upstream of a sequence selected from the group consisting of SEQ 
ED NOs. 24-81 1 and SEQ BD NOs. 1600-1622 and the sequences complementary to SEQ ID NOs. 

35 24-81 1 and SEQ ED NOs. 1600-1622 which arc transcription factor binding sites or transcription start 
sites. 
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Another embodiment of the present invention is a isolated promoter obtainable by the 
method of the paragraph above. 

Another embodiment of the present invention is the inclusion of at least one sequence 
selected from the group consisting of SEQ ID NOs. 24-81 1 and SEQ ED NOs. 1600-1622, the 
5 sequences complementary to the sequences of SEQ ID NOs. 24^81 1 and SEQ ID NOs. 1600-1622 
and fragments comprising at least 12, 15, 18, 20, 23, 25, 28, 30, 35, 40, 50, or 100 consecutive 
nucleotides of said sequence in an array of discrete ESTs or fragments thereof of at least 12, 15, 18, 
20, 23, 25, 28, 30, 35, 40, 50, or 100 nucleotides in length. In some aspects of this embodiment, the 
array includes at least two sequences selected from the group consisting of SEQ ID NOs. 24-8 11 and 
1 0 SEQ ID NOs. 1 600- 1 622, the sequences complementary to the sequences of SEQ ID NOs. 24-8 1 1 
and SEQ ED NOs. 1600-1622, and fragments comprising at least 12, 15, 18, 20, 23, 25, 28, 30, 35, 40, 
50, or 100 consecutive nucleotides of said sequences. In another aspect of this embodiment., the 
array includes at least one, three, five, ten, fifteen, or twenty sequences selected from the group 
consisting of SEQ ID NOs. 24-81 1 and SEQ ED NOs. 1600-1622, the sequences complementary to 
15 the sequences of SEQ ID NOs. 24-81 1 and SEQ ID NOs. 1600-1622 and fragments comprising at 
least 12, 15, 18, 20, 23, 25, 28, 30, 35, 40, 50, or 100 consecutive nucleotides of said sequences. 

Another embodiment of the present invention is an enriched population of recombinant 
nucleic acids, said recombinant nucleic acids comprising an insert nucleic acid and a backbone 
nucleic acid, wherein at least 0.01%, 0.05%, 0.1%, 0.5%, 1%, 2%, 5%, 10%, or 20% of said insert 
20 nucleic acids in said population comprise a sequence selected from the group consisting of SEQ ID 
NOs. 24-81 1 and SEQ ED NOs. 1600-1622 and the sequences complementary to SEQ ED NOs. 24- 
811 and SEQ ED NOs. 1600-1622. 

Another embodiment of the present invention is a purified or isolated antibody capable of 
specifically binding to a polypeptide comprising a sequence selected from the group consisting of 
25 SEQ ID NOs. 812^1599. 

Another embodiment of the present invention is a purified or isolated antibody capable of 
specifically binding to a polypeptide comprising at least 6, 8, 10, 12, 15, 18, 20, 23, 25, 28, 30, 35, 40, 
or 50 consecutive amino acids of a sequence selected from the group consisting of SEQ ID NOs. 812- 
1599. 

30 Yet, another embodiment of the present invention is an antibody composition capable of 

selectively binding to an epitope-containing fragment of a polypeptide comprising a contiguous span 
of at least 8, 10, 12, 15, 18, 20, 23, 25, 28, 30, 35, 40, or 50 amino acids of any of SEQ ID NOs. 812- 
1599, wherein said antibody is polyclonal or monoclonal. 

Another embodiment of the present invention is a computer readable medium having stored 

35 thereon a sequence selected from the group consisting of a nucleic acid code of SEQ ID NOs. 24-8 1 1 
and 1600-1622 and a polypeptide code of SEQ ID NOs. 812-1599. 
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Another embodiment of the present invention is a computer system comprising a processor 
and a data storage device v^herein said data storage device has stored thereon a sequence selected 
from the group consisting of a nucleic acid code of SEQID NOs. 24-81 1 and 1600-1622 and a 
polypeptide code of SEQ ID NOs. 812-1599. In one aspect of this embodiment the computer system 
5 further comprises a sequence comparer and a data storage device having reference sequences stored 
thereon. For example, the sequence comparer may comprise a computer program which indicates 
polymorphisms. In another aspect of this embodiment, the computer system further comprises an 
identifier which identifies features in said sequence. 

Another embodiment of the present invention is a method for comparing a first sequence to a 
10 reference sequence wherein said first sequence is selected from the group consisting of a nucleic acid 
code of SEQID NOs. 24-81 1 and 1600-1622 and a polypeptide code of SEQ ID NOs. 812-1599 
comprising the steps of reading said first sequence and said reference sequence through use of a 
computer program which compares sequences and determining differences between said first sequence 
and said reference sequence with said computer program. In some aspects of this embodiment, said step 
15 of determining differences between the first sequence and the reference sequence comprises 
identifying polymorphisms. 

Another embodiment of the present invention is a method for identifying a feature in a 
sequence selected from the group consisting of a nucleic acid code of SEQID NOs. 24-81 1 and 1600- 
1622 and a polypeptide code of SEQ ID NOs. 812-1599 comprising the steps of reading said 
20 sequence through the use of a computer program which identifies features in sequences and identifying 
features in said sequence with said computer program. 

Another embodiment of the present invention is a vector comprising a nucleic acid according 
to any one of the nucleic acids described above. 

Another embodiment of the present invention is a host cell containing the above vector. 
25 Another embodiment of the present invention is a method of making any of the nucleic acids 

described above comprising the steps of introducing said nucleic acid into a host cell such that said 
nucleic acid is present in multiple copies in each host cell and isolating said nucleic acid from said host 
cell. 

Another embodiment of the present invention is a method of making a nucleic acid of any of 
30 the nucleic acids described above comprising the step of sequentially linking together the nucleotides 
in said nucleic acids. 

Another embodiment of the present invention is a method of making any of the polypeptides 
described above wherein said polypeptides is 150 amino acids in length or less comprising the step of 
sequentially linking together the amino acids in said polypeptide. 
35 Another embodiment of the present invention is a method of making any of the polypeptides 

described above wherein said polypeptides is 120 amino acids in length or less comprising the step of 
sequentially linking together the amino acids in said polypeptides. 



wo 99/53051 



17 



PCT/1B99/00712 



Brief Description of the Drawings 
Figure I is a summary of a procedure for obtaining cDNAs which have been selected to include 
the 5' ends of the mRNAs from which they derived. In the first step (1 ), the cap of intact mRNAs is 
5 oxidized to be chemically ligated to an oligonucleotide tag. In the second step (2), a reverse transcription 
is performed using random primers to generate a first cDNA strand. In the third step (3), mRNAs are 
eliminated and the second strand synthesis is carried out using a primer contained in the oligonucleotide 
tag. 

Figure 2 is an analysis of the 43 amino terminal amino acids of all human SwissProt proteins to 
10 determine the fi-equency of false positives and false negatives using the techniques for signal peptide 
identification described herein. 

Figure 3 summarizes a general method used to clone and sequence extended cDNAs containing 
sequences adjacent to S'ESTs. 

Figure 4 provides a schematic description of the promoters isolated and the way they are 
1 5 assembled with the corresponding 5' tags. 

Figure 5 describes the transcription factor binding sites present in each of the promoters of 
Figure 4. 

Figure 6 is a block diagram of an exemplary computer system. 

Figure 7 is a flow diagram illustrating one embodiment of a process 200 for comparing a new 
20 nucleotide or protein sequence with a database of sequences in order to determine the homology levels 
between the new sequence and the sequences in the database. 

Figure 8 is a flow diagram illustrating one embodiment of a process 250 in a computer for 
determining whether two sequences are homologous. 

Figure 9 is a flow diagram illustrating one embodiment of an identifier process 300 for 
25 detecting the presence of a feature in a sequence. 

Figure 10 is a table with all of the parameters that can be used for each step of extended cDNA 
analysis. 

Detailed Description of the Preferred Embodiment 
30 1. Obtaining S'ESTs from cDNA libraries including the S^Ends of their Corresponding mRNAs 
The 5' ESTs of the present invention were obtained from cDNA libraries including cDNAs 
which include the 5'end of their corresponding mRNAs. The general method used to obtain such cDNA 
libraries is described in Examples 1 to 5. 

EXAMPLE 1 

35 Preparation of mRNA 

Total human RNAs or polyA* RNAs derived from 29 different tissues were respectively 
purchased from LABIMO and CLONTECH and used to generate 44 cDNA libraries as described below. 
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The purchased RNA had been isolated from cells or tissues using acid guanidium thiocyanate-phenol- 
chlorofomi extraction (Chomczyniski and Sacchi, Analytical Biochemistry 162:156-159, 1987). PolyA^ 
RNA was isolated from total RNA (LABIMO) by two passes of oligo dT chromatography, as described 
by Aviv and Leder, Proa. Natl. Acad. Sci. USA 69:1408-1412, 1972) in order to eliminate ribosomal 
5 RNA. 

The quality and the integrity of the polyA+ RNAs were checked. Northern blots hybridized 
with a globin probe were used to confirm that the mRNAs were not degraded. Contamination of the 
poly A"" mRNAs by ribosomal sequences was checked using Northern blots and a probe derived from 
the sequence of the 28S rRNA. Preparations of mRNAs with less than 5% of rRNAs were used in 
1 0 library construction. To avoid constructing libraries with RNAs contaminated by exogenous sequences 
(prokaryotic or fimgal), the presence of bacterial 16S ribosomal sequences or of two highly expressed 
fungal mRNAs was examined using PCR. 

EXAMPLE 2 

15 Methods for Obtaining mRNAs having Intact 5' Ends 

Following preparation of the mRNAs from various tissues as described above, selection of 
mRNA with intact 5* ends and specific attachment of an oligonucleotide tag to the 5' end of such mRNA 
was performed using either a chemical or enzymatic approach. Both techniques takes advantage of the 
presence of the "cap*' structure, vAikh characterizes the 5'end of intact mRNAs and which comprises a 

20 guanosine generally methylated once, at the 7 position. The chemical approach is illustrated in Figure 1 . 

The chemical modification approach involves the optional elimination of the 2', 3'-cis diol of the 
3' terminal ribose, the oxidation of the 2\ 3\ -cis diol of the ribose linked to the cap of the 5' ends of the 
mRNAs into a dialdehyde, and the coupling of flie such obtained dialdehyde to a derivatized 
oligonucleotide tag. Further detail regarding the chemical approaches for obtaining mRNAs having 

25 intact 5' ends are disclosed in International Application No. W096/3498 1 , published November 7, 1996. 

The enzymatic approach for iigating the oligonucleotide tag to the 5' ends of mRNAs with intact 
5* ends involves the removal of the phosphate groups present on the 5' ends of uncapped incomplete 
mRNAs, the subsequent decapping of mRNAs with intact 5' ends and the ligation of the phosphate 
present at the 5' end of the decapped mRNA to an oligonucleotide tag. Further detail regarding the 

30 enzymatic approaches for obtaining mRNAs having intact 5' ends are disclosed in Dumas Milne 
Edwards J.B. (Doctoral Thesis of Paris VI University, Le clonage des ADNc complets: difiicultes et 
perspectives nouvelles. Apports pour I'etude de la regulation de I'expression de la tryptophane 
hydroxylase de rat, 20 Dec. 1993), EPO 625572 and Kato et ai. Gene 150:243-250 (1994). 

In either the chemical or the enzymatic approach, the oligonucleotide tag has a restriction 

35 enzyme site (e.g. EcoRI sites) therein to facilitate later cloning procedures. Following attachment of the 
oligonucleotide tag to the mRNA, the integrity of the mRNA was then examined by performing a 
Northern blot using a probe complementary to the oligonucleotide tag. 
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EXAMPLES 

cDNA Synthesis Using mRNA Templates Having Intact 5' Ends 
For the mRNAs joined to ohgonucleotide tags, first strand cDNA synthesis was performed using 
5 a reverse transcriptase with random nonamers as primers. In order to protect internal EcoRI sites in the 
cDNA from digestion at later steps in the procedure, methylated dCTP was used for first strand 
synthesis. After removal of mRNA by an alkaline hydrolysis, the first sirand of cDNA was precipitated 
using isopropanol in order to eliminate residual primers. 

The second strand of the cDNA was synthesized with a Klenow fragment using a primer 
1 0 corresponding to the 5'end of the ligated oligonucleotide. Methylated dCTP was also used for second 
strand synthesis in order to protect internal EcoRI sites in the cDNA from digestion during the cloning 
process. 

EXAMPLE 4 

15 Cloning of cDNAs derived from mRNA with intact 5* ends into BlueScript 

Following second strand synthesis, the ends of the cDNA were blunted with T4 DNA 
polymerase (Biolabs) and the cDNA was digested vnih EcoRI. Since methylated dCTP was used during 
cDNA synthesis, the EcoRI site present in the tag was the only hemi-methylated site, hence the only site 
susceptible to EcoRI digestion. The cDNA was then size fractionated using exclusion chromatography 

20 (AcA, Bioscpra) and fractions corresponding to cDN As of more than 1 50 bp were pooled and ethanol 
precipitated. The cDNA was directionally cloned into the Smal and EcoRI ends of the phagemid 
pBlueScript vector (Stratagene). The Hgation mixture was electroporated into bacteria and propagated 
under appropriate antibiotic selection. 

25 EXAMPLE 5 

Selection of Clones Having the Oligonucleotide Tag Attached Thereto 
Clones containing the oligonucleotide tag attached were then selected as follows. The plasmid 
DNAs containing 5' EST libraries made as described above were purified (Qiagen). A positive selection 
of the tagged clones was performed as follows. Briefly, in this selection procedure, the plasmid DNA 

30 was converted to single stranded DNA using gene n endonuclease of the phage Fl in combination with 
an exonuclease (Chang et al. Gene 127:95-8, 1993) such as exonuclease HI or T7 gene 6 exonuclease. 
The resulting single stranded DNA was then purified using paramagnetic beads as described by Fry et 
ai, Biotechniques, 13: 124-131, 1992. In this procedure, the single stranded DNA was hybridized with a 
biotinylated oligonucleotide having a sequence corresponding to the 3' end of the oligonucleotide tag. 

35 Clones including a sequence complementaiy to the biotinylated oligonucleotide were captured by 

incubation with streptavidin coated magnetic beads followed by magnetic selection. After capture of the 
positive clones, the plasmid DNA was released from the magnetic beads and converted into double 
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stranded DNA using a DNA polymerase such as the Thermosequenase obtained from Amersham 
Pharmacia Biotech. The double stranded DNA was then electroporated into bacteria. The percentage of 
positive clones having the 5' tag oligonucleotide was estimated using dot blot analysis to typically be 
between 90 and 98%. 

5 Followang electroporation, the libraries were ordered in 384-microtiter plates (MTP). A copy of 

the MTP was stored for future needs. Then the libraries were transferred into 96 MTP and sequenced as 
described below. 

EXAMPLE 6 

10 Sequencing of Inserts in Selected Clones 

Plasmid inserts were first amplified by PGR on PE-9600 thermocyclers (Pcrkin-Elmer, Applied 
Biosystems Division, Foster City, CA), using standard SETA-A and SETA-B primers (Genset SA), 
AmpliTaqGold (Perkin-Elmer), dNTPs (Boehringer), buffer and cycling conditions as recommended by 
the Perkin-Elmer Corporation. 

1 5 PCR products were then sequenced using automatic ABI Prism 377 sequencers (Perkin Elmer). 

Sequencing reactions were performed using PE 9600 thermocyclers vn± standard dye-primer chemistry 
andThermoSequenase (Amersham Pharmacia Biotech). The primers used were either T7 or 21M13 
(available from Genset SA) as appropriate. The primers were labeled with the JOE, FAM, ROX and 
TAMRA dyes. The dNTPs and ddNTPs used in the sequencing reactions were purchased from 

20 Boehringer, Sequencing buffer, reagent concentrations and cycling conditions were as recommended by 
Amersham. 

Following the sequencing reaction, the samples were precipitated with ethanol, resuspended in 
formamide loading buffer, and loaded on a standard 4% acrylamide gel Electrophoresis was performed 
for 2.5 hours at 3000V on an ABI 377 sequencer, and the sequence data were collected and analyzed 
25 using the ABI Prism DNA Sequencing Analysis Software, version 2.1 .2. 

EXAMPLE 7 
Obtaining 5' ESTs from Extended cDNA libraries 
Obtained from mRNA with Intact 5* Ends 
30 Alternatively, 5'ESTs may be isolated from other cDNA or genomic DNA libraries. Such 

cDNA or genomic DNA libraries may be obtained from a commercial source or made using other 
techniques familiar to diose skilled in the art. One example of such cDNA library construction, a full- 
length cDNA library, is as follows. 

PolyA+ RNAs are prepared and their quality checked as described in Example 1. Then, the 
35 caps at the 5' ends of the polyA+ RNAs are specifically joined to an oligonucleotide tag as described in 
Example 2. The oligonucleotide tag may contain a restriction site such as Eco RI to facihtate further 
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subcloning procedures. Northern blotting is then performed to check the size of mRNAs having the 
oligonucleotide tag attached thereto and to ensure that the mRNAs are actually tagged. 

First strand synthesis is subsequently carried out for mRNAs joined to the oligonucleotide tag as 
described in Example 3 above except that the random nonamers are replaced by an oligo dT primer. For 
5 instance, this oligo-dT primer may contain an intemal tag of 4 nucleotides which is different from one 
tissue to the other. Following second strand synthesis using a primer contained in the oligonucleotide 
tag attached to the 5' end of mRNA, the blunt ends of the obtained double stranded full-length DNAs 
are modified into cohesive ends to facilitate subcloning. For example, the extremities of full-length 
cDNAs may be modified to allow subcloning into the Eco RI and Hind III sites of a Bluescript vector 
10 using the Eco Rl site of the oligonucleotide tag and the addition of a Hind III adaptor to the 3' end of 
full-length cDNAs. 

The full-length cDNAs are then separated into several fractions according to their sizes using 
techniques familiar to those skilled in the art. For example, electrophoretic separation may be applied in 
order to yield 3 or 6 different fractions. Following gel extraction and purification, the cDNA firactions 
1 5 are subcloned into appropriate vectors, such as Bluescript vectors, transformed into competent bacteria 
and propagated under appropriate antibiotic conditions. Subsequently, plasmids containing tagged fiill- 
length cDNAs are positively selected as described in Example 5. 

The 5' end of full-length cDNAs isolated from such cDNA libraries may then be sequenced as 
described in Example 6 to yield 5'ESTs. 

20 

II. Computer Analysis of the Isolated 5' ESTs: Construction of the SignalTag™ Database 

The sequence data from the cDNA libraries made as described above were transferred to a 
database, where quality control and validation steps were performed. A base-caller, working using a 
Unix system, automatically flagged suspect peaks, taking into account the shape of the peaks, the inter- 

25 peak resolution, and the noise level. The base-caller also performed an automatic trimming. Any stretch 
of 25 or fewer bases having more than 4 suspect peaks was considered unreHable and was discarded. 
Sequences corresponding to cloning vector or ligation oligonucleotides were automatically removed 
fiiom the EST sequences. However, the resulting EST sequences may contain 1 to 5 bases belonging to 
the above mentioned sequences at their 5' end. If needed, these can easily be removed on a case to case 

30 basis. 

Following sequencing as described above, the sequences of the 5' ESTs were entered in a 
database for storage and manipulation as described below. Before searching the ESTs in the database for 
sequences of interest, ESTs derived from mRNAs which were not of interest were identified. Briefly, 
such undesired sequences may be of three types. First, contaminants of either endogenous (ribosomal 
35 RNAs, transfert RNAs, mitochondrial RNAs) or exogenous (prokaryotic RNAs and fungal RNAs) 
origins were identified. Second, uninformative sequences, namely redundant sequences, small 
sequences and highly degenerate sequences were identified. Third, repeated sequences (Alu, LI , THE 
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and MER repeats, SSTR sequences or satellite, micro-satellite, or telomeric repeats) were identified and 
masked in further processing. 

in order to determine the accuracy of the sequencing procedure as well as the efficiency of the 
5' selection described above, the analyses described in Examples 8 and 9 respectively were performed 
5 on 5'ESTs obtained from the database following the elimination of endogenous and exogenous 
contaminants and following the masking of repeats. 

EXAMPLES 

Measurement of Sequencing Accuracy by Comparison to Known Sequences 
1 0 To further determine the accuracy of the sequencing procedure described in Example 6, the 

sequences of 5* ESTs derived from known sequences were identified and compared to the original 
known sequences. First, a FASTA analysis with overhangs shorter than 5 bp on both ends was 
conducted on the 5' ESTs to identify those matching an entry in the public human mRNA database 
available at the time of filing the priority applications. The 5' ESTs which matched a known human 
1 5 mRNA were then realigned with their cognate mRNA and dynamic programming was used to include 
substitutions, insertions, and deletions in the list of ^'errors" which would be recognized. Errors 
occurring in the last 10 bases of the 5' EST sequences were ignored to avoid the inclusion of spurious 
cloning sites in the analysis of sequencing accuracy. This analysis revealed that the sequences 
incorporated in the database had an accuracy of more than 99.5%. 

20 

EXAMPLE 9 

Determination of Efficiency of 5* EST Selection 
To determine the efficiency at which the above selection procedures isolated 5' ESTs which 
included sequences close to the 5' end of the mRNAs from which they derived, the sequences of the 

25 ends of the 5' ESTs derived from the elongation factor 1 subunit a and ferritin heavy chain genes were 
compared to the known cDNA sequences of these genes. Since the transcription start sites of both genes 
are well characterized, they may be used to determine the percentage of derived 5* ESTs which included 
the authentic transcription start sites. For both genes, more than 95% of the obtained 5' ESTs actually 
included sequences close to or upstream of the 5' end of the corresponding mRNAs. 

30 To extend the analysis of the reliability of the procedures for isolating 5' ESTs from ESTs in the 

database, a similar analysis was conducted using a database composed of human mRNA sequences 
extracted from GenBank database release 97 for comparison. The 5' ends of more than 85% of 5' ESTs 
derived from mRNAs included in the GenBank database were located close to the 5' ends of the known 
sequence. As some of the mRNA sequences available in the GenBank database are deduced from 

35 genomic sequences, a 5' end matching with these sequences will be counted as an internal match. Thus, 
the method used here underestimates the yield of ESTs including the authentic 5' ends of their 
corresponding mRNAs. 
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EXAMPLE 10 
Calculation of Novelty Indices for 5'EST Libraries 
In order to evaluate the novelty of 5'EST libraries, the following analysis was performed. For 
5 each sequenced 5'EST library, the sequences were clustered by the 5' end. Each sequence in the library 
was compared to the others and the longest sequence found in the cluster was used as representative of 
the group. A novelty rate (NR) was then defined as: NR= 100 X (Number of new unique sequences 
found in the library/Total number of sequences from the library). Typically, novelty rating ranged 
between 10% and 41% depending on the tissue from which the 5*EST library was obtained. For most of 
10 the libraries, the random sequencing of 5' EST libraries was pursued until the novelty rate reached 20%. 

EXAMPLE 11 
Generation of Consensus ContigLated 5' ESTs 
Since the cDNA libraries made above include multiple 5* ESTs derived from the same mRNA, 
15 overlapping 5'ESTs may be assembled into continuous sequences. The following method describes how 
to efficiently align multiple 5'ESTs in order to yield not only consensus contigated 5'EST sequences for 
mKNAs derived from different genes but also consensus contigated 5'EST sequences for different 
mRNAs, so called variants, transcribed from the same gene such as alternatively spliced mRNAs. 

The whole set of sequences was first partitioned into small clusters containing sequences 
20 which exhibited perfect matches with each other on a given length and which derived from a small 
number of different genes. Some 5'EST sequences, so called singletons, were not aligned using this 
approach because they were not homologous to any other sequence. 

Thereafter, all variants of a given gene were identified in each cluster using a proprietary 
software. 5'EST sequences belonging to the same variant were then contigated and consensus 
25 contigated 5'EST sequences generated for each variant. All consensus contigated 5' EST sequences 
were subsequently compared to the whole set of individual 5'EST sequences used to obtained them. 

If desired, the consensus contigated 5'EST sequences may be verified by identifying clones 
in nucleic acid samples derived from biological tissues, such as cDNA libraries, which hybridize to 
the probes based on the sequences of the consensus contigated 5'ESTs using any methods described 
30 herein and sequencing those clones. 

Application of this alignment method to a selected set of 5'ESTs free from endogenous 
contaminants and uninformative sequences, and following the masking of repeats, yielded consensus 
contigated 5'EST sequences or variants of clustered genes encompassing many individual 5'ESTs. 
Both non aligned 5 'ESTs, i.e. singletons, and consensus contigated 5'ESTs were then compared to 
35 already known sequences and those sequences matching human mRNA sequences were eliminated 
from further analysis. 
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EXAMPLE 12 
Identification of Open Reading Frames in 5' ESTs 
Subsequently, consensus contigated 5 'ESTs and 5 'ESTs were screened to identify those 
having an open reading frame (ORF). 
5 Such open reading frames were simply defined as uninterrupted nucleic acid sequences longer 

than 45 nucleotides and beginning with an ATG codon. 

Alternatively, the nucleic acid sequence was first divided into several subsequences which 
coding propensity was evaluated separately using one or several different methods known to those 
skilled in the art such as the evaluation of N-mer frequency and its variants (Fickett and Tung, 
10 Nucleic Acids /?es;20:6441-50 (1992)) or the Average Mutual Information method (Grosse et al. 
International Conference on Intelligent Systems for Molecular Biology, Montreal, Canada. June 28- 
July 1, 1998). Each of the scores obtained by the techniques described above were then normalized 
by their distribution extremities and then fused using a neural network into a unique score that 
represents the coding probability of a given subsequence. The coding probability scores obtained for 
15 each subsequence, thus the probability score profiles obtained for each reading frame, was then 
linked to the initiation codons present on the sequence. For each open reading frame, defined as a 
nucleic acid sequence beginning with an ATG codon, an ORF score was determined. Preferably, this 
score is the sum of the probability scores computed for each subsequence corresponding to the 
considered ORF in the correct reading fi^me corrected by a function that negatively accounts for 
20 locally high score values and positively accounts for sustained high score values. The most probable 
ORF with the highest score was selected. 

In some embodiments, nucleic acid sequences encoding an "incomplete ORF", as referred 
therein, namely an open reading frame in which a start codon has been identified but no stop codon has 
been identified, were obtained. 
25 In other embodiments, nucleic acid sequences encoding a "complete ORF", as used therein, 

namely an open reading frame in which a start codon and a stop codon have been identified, are 
obtained. 

In a preferred embodiment, open reading frames encoding polypeptides of at least 50 amino 
acids were obtained. 

30 To confirm that the chosen ORF actually encodes a polypeptide, the consensus contigated 

5'EST or 5'EST may be used to obtain an extended cDNA using any of the techniques described therein, 
and especially those described in Examples 19 and 20. Then, such obtained extended cDNAs may be 
screened for the most probable open reading fi:ame using any of the techniques described therein. The 
amino acid sequence of the ORF encoded by the consensus contigated 5'EST or 5'EST may then be 

35 compared to the amino acid sequence of the ORF encoded by the extended cDNA using any of the 
algorithms and parameters described therein in order to determine whether the ORF encoded by the 
extended cDNA is basically the same as the one encoded by the consensus contigated 5'EST or 5'EST. 
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Alternatively, to confirm that the chosen ORF actually encodes a polypeptide, the consensus 
contigated 5 'EST or 5 'EST may be used to obtain an extended cDNA using any of the techniques 
described therein, and especially those described in Examples 19 and 20. Such an extended cDNA may 
then be inserted into an appropriate expression vector and used to express the polypeptide encoded by 
5 the extended cDNA as described therein. The expressed polypeptide may be isolated, purified, or 
enriched as described therein. Several methods known to those skilled in the art may then be used to 
determine whether the expressed polypeptide is the one actually encoded by the chosen ORF, therein 
referred to an the expected polypeptide. Such methods are based on the determination of predictable 
features of the expressed polypeptide, including but not limited to its amino acid sequence, its size or its 

10 charge, and the comparison of these features to those predicted for the expected polypeptide. The 
following paragraphs present examples of such methods. 

One of these methods consists in the determination of at least a portion of the amino acid 
sequence of the expressed polypeptide using any technique known to those skilled in the art. For 
example, the amino-terminal residues may be determined using techniques either based on Sanger's 

15 technique of acid hydrolysis of a polypeptide which N-terminal residue has been covalently labeled or 
using techniques based on Edman degradation of polypeptides which N-terminal residues are 
sequentially labeled and cleaved from the polypeptide of interest. The amino acid sequence of the 
expressed polypeptide may then be compared to the one predicted for the expected polypeptide using 
any algorithm and parameters described therein. 

20 Alternatively, the size of the expressed polypeptides may be determined using techniques 

familiar to those skilled in the art such as Coomassie blue or silver staining and subsequently compared 
to the size predicted for the expected polypeptide. Generally, the band corresponding to the expressed 
polypeptide will have a mobility near that expected based on the number of amino acids in the open 
reading frame of the extended cDNA. However, the band may have a mobility different than that 

25 expected as a result of modifications such as glycosylation, ubiquitination, or enzymatic cleavage. 

Alternatively, specific antibodies or antipeptides may be generated against the expected 
polypeptide as described in Example 34 and used to perform immunoblotting or immunoprecipitation 
studies against the expressed polypeptide. The presence of a band in samples from cells containing the 
expression vector with the extended cDNA which is absent in samples from cells containing the 

30 expression vector encoding an irrelevant polypeptide indicates that the expected polypeptide or portion 
thereof is being expressed. Generally, the band corresponding to the expressed polypeptide will have a 
mobility near that expected based on the number of amino acids in the open reading frame of the 
extended cDNA. However, the band may have a mobility different than that expected as a result of 
modifications such as glycosylation, ubiquitination, or enzymatic cleavage 

35 

EXAMPLE 13 
Identification of Potential Signal Seouences in 5' ESTs 
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The 5'ESTs or consensus contigated 5'ESTs found to encode an ORF were then searched to 
identify potential signal motifs using slight modifications of the procedures disclosed in Von Heijne, 
Nucleic Acids Res. 7^:4683-4690, 1986. Those sequences encoding a 15 amino acid long stretch with a 
score of at least 3.5 in the Von Heijne signal peptide identification matrix were considered to possess a 
5 signal sequence. Those nucleic acid sequences which match a known human mRNA or EST sequence 
and have a 5' end located downstream of the known 5' end, preferably by more than 20 nucleotides, were 
excluded from further analysis. The remaining nucleic acids having signal sequences therein were 
included in a database called SignalTag'^'^. 

10 EXAMPLE 14 

Confirmation of Accuracy of Identification of Potential Signal Sequences in 5' ESTs 
The accuracy of the above procedure for identifying signal sequences encoding signal peptides 
was evaluated by applying the method to the 43 amino acids located at the N terminus of all human 
SwissProt proteins. The computed Von Heijne score for each protein was compared with the known 

1 5 characterization of the protein as being a secreted protein or a non-secreted protein. In this maimer, the 
number of non-secreted proteins having a score higher than 3.5 (false positives) and the number of 
secreted proteins having a score lower than 3.5 (false negatives) could be calculated. 

Using the results of the above analysis, the probability that a peptide encoded by the 5* region of 
the mRNA is in fact a genuine signal peptide based on its Von Heijne's score was calculated based on 

20 either the assumption that 10% of human proteins are secreted or the assumption that 20% of human 
proteins are secreted. The results of this analysis are shown in Figure 2. 

Using the above method of identification of secretory proteins, 5' ESTs of the following 
polypeptides known to be secreted were obtained: human glucagon, gamma interferon induced 
monokine precursor, secreted cyclophilin-like protein, human pleiotropin, and human biotinidase 

25 precursor. Thus, the above method successfully identified those 5' ESTs which encode a signal peptide. 
To confirm that the signal peptide encoded by the 5' ESTs or consensus contigated 5' ESTs 
actually functions as a signal peptide, the signal sequences from the 5' ESTs or consensus contigated 5' 
ESTs may be cloned into a vector designed for the identification of signal peptides. Such vectors are 
designed to confer the ability to grow in selective medium only to host cells containing a vector with an 

30 operably linked signal sequence. For example, to confirm that a 5' EST or consensus contigated 5' EST 
encodes a genuine signal peptide, the signal sequence of the 5' EST or consensus contigated 5' EST may 
be inserted upstream and in frame with a non-secreted form of the yeast invertase gene in signal peptide 
selection vectors such as those described in U.S. Patent No. 5,536,637. Growth of host cells containing 
signal sequence selection vectors with the correctly inserted 5' EST or consensus contigated 5' EST 

35 signal sequence confirms that the 5' EST or consensus contigated 5' ESTs encodes a genuine signal 
peptide. 
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Alternatively, the presence of a signal peptide may be confirmed by cloning the extended 
cDNAs obtained using the ESTs or consensus contigated 5' ESTs into expression vectors such as pXTl 
as described below, or by constructing promoter-signal sequence-reporter gene vectors which encode 
fusion proteins between the signal peptide and an assayable reporter protein. After introduction of these 
5 vectors into a suitable host cell, such as COS cells or MM 3T3 cells, the growth medium may be 
harvested and analyzed for the presence of the secreted protein. The medium from these cells is 
compared to the medium from control cells containing vectors lacking the signal sequence or extended 
cDNA insert to identify vectors which encode a fiinctional signal peptide or an authentic secreted 
protein. 

10 

EXAMPLE 15 
Analysis of the Sequences of the Invention 
The set of the nucleic acid sequences of the invention (SEQ ID NOs.24-81 1 and 1600-1622) 
was obtained as described in Example 11. Subsequently, the most probable open reading frame was 
15 determined and signal sequences were searched, as described in Examples 12 and 13, for all sequences 
of the invention. 

The nucleotide sequences of the SEQ ID NOs. 24-81 1 and 1600-1622 and the polypeptides 
sequences encoded by SEQ ID NOs. 24-81 1 (i.e. polypeptide sequences of SEQ ID NOs. 812-1599) 
are provided in the appended sequence listing which structure is as follows. 

20 SEQ ID NOs. 24-728 are nucleic acids having an incomplete ORF which encodes a signal 

peptide. The locations of the incomplete ORFs and sequences encoding signal peptides are listed in the 
accompanying Sequence Listing. In addition, the von Heijne score of the signal peptide computed as 
described in Example 13 is listed as the "score" in the accompanying Sequence Listing. The sequence of 
the signal-peptide is listed as "seq" in the accompanying Sequence Listing. The in the signal peptide 

25 sequence indicates the location where proteolytic cleavage of the signal peptide occurs to generate a 
mature protein. 

SEQ ID NOs. 729-765 are nucleic acids having an incomplete ORF in which no sequence 
encoding a signal peptide has been identified to date. However, it remains possible that subsequent 
analysis will identify a sequence encoding a signal peptide in these nucleic acids. The locations of the 
30 incomplete ORFs are listed in the accompanying Sequence Listing. 

SEQ ID NOs. 766-792 are nucleic acids having a complete ORF which encodes a signal 
peptide. The locations of the complete ORFs and of the signal peptides, the von Heijne score of the 
signal peptide, the sequence of the signal-peptide and the proteolytic cleavage site are indicated as 
described above. 

35 SEQ ID NOs. 793-81 1 are nucleic acids having a complete ORF in which no sequence encoding 

a signal peptide has been identified to date. However, it remains possible that subsequent analysis will 
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identify a sequence encoding a signal peptide in these nucleic acids. The locations of the complete ORFs 
are listed in the accompanying Sequence Listing. 

SEQ ID NOs. 8 1 2- 1 5 1 6 are "incomplete polypeptide sequences" which include a signal peptide. 
"Incomplete polypeptide sequences" are polypeptide sequences encoded by nucleic acids in which a start 
5 codon has been identified but no stop codon has been identified. These polypeptides are encoded by the 
nucleic acids of SEQ ED NOs. 24-728. The location of the signal peptide, the von Heijne score of the 
signal peptide, the sequence of the signal-peptide and the proteolytic cleavage site are indicated as 
described above. 

SEQ ID NOs. 1517-1553 are incomplete polypeptide sequences in which no signal peptide has 
10 been identified to date. However, it remains possible that subsequent analysis will identify a signal 
peptide in these polypeptides. These polypeptides are encoded by the nucleic acids of SEQ ID NOs. 
729-765. 

SEQ ID NOs. 1 554- 1 580 are "complete polypeptide sequences" which include a signal peptide. 
"Complete polypeptide sequences" are polypeptide sequences encoded by nucleic acids in which a start 
1 5 codon and a stop codon have been identified. These polypeptides are encoded by the nucleic acids of 
SEQ ID NOs. 766-792. The location of the signal peptide, the von Heijne score of the signal peptide, the 
sequence of the signal-peptide and the proteolytic cleavage site are indicated as described above.. 

SEQ ID NOs. 1581-1599 are complete polypeptide sequences in which no signal peptide has 
been identified to date. However, it remains possible that subsequent analysis will identify a signal 
20 peptide in these polypeptides. These polypeptides are encoded by the nucleic acids of SEQ ID NOs.793- 
811. 

SEQ ID NOs. 1600-1622 are nucleic acid sequences in which no open reading fi^me has been 
conclusively identified to date. However, it remains possible subsequent analysis will identify an open 
reading frame in these nucleic acids. 

25 In the accompanying Sequence Listing, all instances of the symbol "n" in the nucleic acid 

sequences mean that the nucleotide can be adenine, guanine, cytosine or thymine. In some instances the 
polypeptide sequences in the Sequence Listing contain the symbol "Xaa." These "Xaa" symbols 
indicate either (1) a residue which cannot be identified because of nucleotide sequence ambiguity or (2) a 
stop codon in the determined sequence where applicants believe one should not exist (if the sequence 

30 were determined more accurately). In some instances, several possible identities of the unknown amino 
acids may be suggested by the genetic code. 

In the case of secreted proteins, it should be noted that, in accordance with the regulations 
governing Sequence Listings, in the appended Sequence Listing, the fiill protein {Le. the protein 
containing the signal peptide and the mature protein) extends from an amino acid residue having a 

35 negative number through a positively numbered C-terminal amino acid residue. Thus, the first amino 
acid of the mature protein resulting from cleavage of the signal peptide is designated as amino acid 
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number 1, and the first amino acid of the signal peptide is designated with the appropriate negative 
number. 

If one of the nucleic acid sequences of SEQ ID NOs. 24-81 1 and 1600-1622 are suspected of 
containing one or more incorrect or ambiguous nucleotides, the ambiguities can readily be resolved by 
5 resequencing a fragment containing the nucleotides to be evaluated. If one or more incorrect or 
ambiguous nucleotides are detected, the corrected sequences should be included in the clusters from 
which the sequences were isolated, and used to compute other consensus contigated sequences on which 
other ORFs would be identified. Nucleic acid fragments for resolving sequencing errors or ambiguities 
may be obtained from deposited clones or can be isolated using the techniques described herein. 

10 Resolution of any such ambiguities or errors may be facilitated by using primers which hybridize to 
sequences located close to the ambiguous or erroneous sequences. For example, the primers may 
hybridize to sequences within 50-75 bases of the ambiguity or error. Upon resolution of an error or 
ambiguity, the corresponding corrections can be made in the protein sequences encoded by the DNA 
containing the error or ambiguity. The amino acid sequence of the protein encoded by a particular clone 

15 can also be determined by expression of the clone in a suitable host cell, collecting the protein, and 
determining its sequence. 

In addition, if one of the sequences of SEQ ID NOs. 812-1599 is suspected of containing a 
truncated ORF as the result of a frameshift in the sequence, such frameshifting errors may be corrected 
by combining the following two approaches. The first one involves thorough examination of all 

20 double predictions, Le. all cases where the probability scores for two ORFs located on different 
reading frames are high and close, preferably different by less than 0.4. The fine examination of the 
region where the two possible ORFs overlap may help to detect the frameshift. In the second 
approach, homologies with known proteins are used to correct suspected frameshifts. 

Of the identified clusters, some were shown to be multivariant, i.e. to contain several variants of 

25 the same gene. Table I gives for each of the multivariant clusters named by its internal reference (first 
column), the list of all variant consensus contigated 5'ESTs (second column), each being represented by 
a different sequence identification number. 



TABLET 



Ouster Internal Reference 


SEQ ID NOs of Variants 


CI 


687, 791 


C2 


744, 798 


C3 


640,811 


C4 


59, 66 


C5 


84, 97 . 
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C6 


287, 289 


C7 


286, 775, 777 


C8 


762, 768 


C9 


783. 784 


CIO 


80, 1603 


Cll 


655, 736 


C12 


805, 806 



Table II provides a list preferred polynucleotide fragments which are derivatives of the 
consensus contigated 5'ESTs. As used herein the terni "polynucleotide described in Table H" refers to 
the all of the preferred polynucleotide fragments defined in Table 13 in the following manner. The 

5 fragments are referred to by their SEQ ID numbers in the first column. The preferred polynucleotide 
fragments are then defined by a range of nucleotide positions from the SEQ IDs of the consensus 
contigated 5'ESTs as indicated in the second column entitled "positions of preferred fragments." The 
preferred polynucleotide fi^igments correspond to the individual 5'ESTs aligned to obtain the consensus 
contigated 5'EST and to those filed in the priority documents. The third column entided "variant 

1 0 nucleotides** describes the nucleotide sequence variations observed between the consensus contigated 
5'EST and preferred nucleic acid fragments as follows: 

A) Substitutions in the sequence of a consensus contigated 5*EST to derive a preferred 
polynucleotide fragment are denoted by an "S", followed by a number indicating the first 
nucleotide position in a specific SEQ ID to be substituted in a string of substituted nucleotides or 

1 5 the position of the substituted nucleotide in the case of a single substituted nucleotide. Then 

there is a coma followed by one or more lower case letters indicating the identity of the 
nucleotide(s) occurring in the substituted position(s). For example, SEQ ID NO: 3401 ; Position 
of preferred fragments: 1-250; Variant nucleotides S45,atc would indicate that a preferred 
polynucleotide fragment had the sequence of positions 1 to 250 of SEQ ID NO. 340 1 , except 

20 that the nucleotides at positions 45, 46, and 47 were substituted with A, T, and C, respectively, 

in the preferred polynucleotide as compared with the sequence of SEQ ED No. 3401 . 

B) Insertions in the sequence of a consensus contigated 5'EST to derive a preferred 
polynucleotide fragment are denoted by an "I", followed by a number indicating the nucleotide 
position in a specific SEQ ID after which a string of nucleotides is inserted or the position after 

25 which the nucleotide is inserted in the case of a single inserted nucleotide. Then there is a coma 

followed by one or more lower case letters indicating the identity of the nucleotide(s) occurring 
in the inserted position(s). For example, SEQ ID NO: 7934; Position of preferred fragments: 1- 
500; Variant nucleotides: B6,gataca would indicate that a preferred polynucleotide fragment 
had the sequence of positions 1 to 500 of SEQ ID NO. 7934, except that after the nucleotides at 

30 position 36 a GATACA string of nucleotides is inserted in the preferred polynucleotide as 

compared with the sequence of SEQ ID No. 7934. 
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C) Deletions in the sequence of a consensus contigated 5'EST to derive a preferred 
nucleic acid fragment are denoted by an "D", followed by a number indicating the first 
nucleotide position in a specific SEQ ED to be deleted in a string of deleted nucleotides or the 
position of the deleted nucleotide in the case of a single deleted nucleotide. Then there is a 
5 coma followed by number indicating the number of nucleotide(s) deleted from the sequence 

provided in the sequence ID. For example, SEQ ID NO: 5398; Position of preferred fragments: 
56-780; Variant nucleotides Dl 14,5 would indicate that a preferred polynucleotide fragment had 
the sequence of positions 56 to 780 of SEQ ID NO. 5398, except that the nucleotides in 
positions 1 14 to 1 18 had been deleted in the preferred polynucleotide as compared with the 
10 sequence of SEQ ID No. 5398. 

The present invention encompasses isolated, purified, or recombinant nucleic acids which consist of, 
consist essentially of, or comprise a contiguous span of at least 8, 10, 12, 15, 18, 20, 25, 35, 40, 50, 70, 
80, 100, 250, or 500 nucleotides in length, to the extent that a contiguous span of these lengths is 
consistent with the lengths of the particular polynucleotide, of a polynucleotide described in Table II, 
15 or a sequence complementary thereto, wherein said polynucleotide described in Table II is selected 
individually or in any combination from the polynucleotides described in Table H. The present 
invention also encompasses isolated, purified, or recombinant nucleic acids which consist of or consist 
essentially of a polynucleotide described in Table H, or a sequence complementary thereto, wherein said 
polynucleotide is selected individually or in any combination from the polynucleotides described in 
20 Table II. The present invention further encompasses isolated or purified polypeptides which consist of, 
consist essentially of, or comprise a contiguous span of at least 8, 10, 12, 15, 18, 20, 25, 35, 40, 50, 70, 
80, or 100 amino acids encoded by a polynucleotide described in Table II. 



Table n 



SEQroNO. 


Positions of 
Preferred 

Fragments 


Variant nucleotides 


35 


1-423 


SI 24, s; 1135, a; S293, w; D63, a; S377, r; 
D424, 15 


41 


1-427 


1117, m; S120, r; S124, g; D373, l;S376,b; 
S378, b; 1427, gggg; D428, 109 


43 


1-276 


S114,m;S118,rg;S123,r;S139,nr,I142, t; 
D148, 1;D152, 1; 1228, t; 1276, gg; D277, 136 


45 


126-420 


Dl, 125; 1420, ggg; D421, 100 


46 


1-255 


S139, r; 1145. S146, mm; S150, ar; S254, g; 

D256, 128 


48 


4-437 


Dl, 3; S49, a; S55, g; S79, a; S90, a; 1437, 
tctctg 


59 


1-471 


S26, a; S44, t; S48, t; S109, a; S191, t; S200, 
gc; S203, a; S210, g; S237, a; S240, g; S255, a; 
S272, a; S277, a; S279, a; S284, t; S297, g; 
S305,g;S316,a:I471.ggtca 


66 


1-428 


1428, tactgggg 
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82 


1-399 


S251, t; S277. d; 1399, aagccggg 


84 


5-488 


Dl, 4; S210, g; S293, a; S325, g; S339, a; 
S348, g; S353, g; S395, g; 1488, caeca 


93 


1-508 


1508, gattt 


96 


26-315 


Dl, 25; S28, a; S62, c; 13 15, cagatgg 


97 


4-460 


Dl, 3; S19, g; S3 1. g; SI 14, gt; SI 18, a; S123, 
to; S127, c; S132, a; S186, g; S190, c; S203, t; 
S210, g; S232, c;I460,acgtt 


105 


1-281 


S273,a; 1281, g; D282, 21 1 


114 


10-315 


10, t; Dl, 9; 391, m; S267, n; S276, w; S292, h; 
S295, m; 1315, tggg; D316, 19 


118 


1-145 


S57,d;S126, d; 1145, ccctc 


120 


2-348 


Dl, 1;S104, t; 1348, g; D349, 38 


121 


1-190 


1121, c; 1190, ccctt 


123 


1-353 


11 17, m; 1186, w; SI 87, y; 1353, caccgggg 


124 


1-249 


1249, ggrvgggg 


125 


1 14-375 


Dl, 113; S206, wn; 1231, a; 1375, ccctagg 


126 


1-437 


S297, cc; S307, tg; S3 1 2, a; S3 1 8, g; S34 1 , a; 
835 1, t; S353, g; S383, c; S387, a; D404, 1 


136 


82-428 


Dl,81;I428,aaagtg 


139 


1-268 


1268, gggaaggg 


148 


6-405 


Dl,5;I405,ggtgt 


159 


1-230 


8227, ta; 1230, ccctggg 


165 


3-256 


10, tat; Dl, 2; 117, c; 818, t; Sll 1, d; 1115, t; 
8123, r; 1256, aaggcggg 


170 


1-280 


1103, t; S104, c; 11 11, t; 1280, cgttcggg 


194 


1-215 


S50, s; SI 86, sn; SI 99, k; 1215, gcagcggg 


213 


1-158 


8128, m; 1132. w; 8143. d; 1158, tgcccggg 


223 


3-431 


Dl, 2; S28. s; 879. c; S82, s; S308, nr. 8328, 
nb; 143 1 , ccggc 


247 


1-359 


176, gttt; 1359, tccctgg 


258 


1-236 


872, r; 881, g; 8197, s; 1205, ss; 8232, k; 1236, 

acttcggg 


264 


5-283 


Dl, 4; 864, g; 8122, m; 8134, yy; 1137, c; 
1151, t; 1283, gttgc 


269 


1-143 


Slll,s;I143,ggggcggg 


286 


5-207 


Dl, 4; 8204. a; 8206, c; 1207, gg; D208, 567 


287 


1-277 


81 14, r, 1125, t; 8131, ag; S256, tg; 8259, tt; 
8262, at; 8267, t; 8269, c; 8273, c; 1277, 
ccggg; D278, 337 


289 


69-416 


Dl, 68;I416, agccaggg 


289 


1-278 


81 14, r; 1125, t; S13I, ag; 8277, c; 1278, cggg; 
D279, 138 


292 


20-254 


Dl, 19;1254, aaagagg 


293 


1-414 


1414, tagcag 


300 


1-285 


816, m; 867, y; 1285, baccacggg; D286, 1 


349 


23-431 


D 1 , 22; 1 1 1 8, a; 82 14, y; 143 1 , caactgg 


350 


3-386 


Dl, 2; S42, w; 1263. c; 1386, gggat 


368 


3-446 


Dl,2;I446,tctct 


385 


1-193 


135, t; 1108, t; 1134, r; 8135, a; 8137, r, 8143, 
w; 1178, c; 1193, gagcgggg 


411 


6-391 


Dl, 5; 817, r; 827, t; 8334, y; D392, 244 


412 


1-185 


849. s; 8127, s; 1185, gctggg; D186, 150 
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415 


2-229 


Dl, I; S3, a; 1229, caaatggg 


435 


1-386 


S4, s; 1386, ccggg 


436 


4-472 


Dl, 3; S6l, sa; D238, 1; S239, s; 1472, agtgtgg 


437 


1-340 


1340, ggg;D34l,129 


441 


M09 


SI 09, smag; 1409, cgcacggg 


454 


1-492 


S72, nn; SI 15. t; S121, bwy; S181, yn; 1492, 
gagtc 


455 


1-177 


114, w; 116, a; 1177, gagctggg 


459 


1-311 


S39, n; S74,rg; 13 1 1 , accatggg 


460 


1-425 


1425, agtac 


461 


5-420 


D1,4;I420, tcgtc 


481 


1-429 


110, w; S262, d; S333, n; 1429, ctccaggg 


489 


1-414 


D72, 1; SI 17, n; S396, d; 1414, ggaca 


496 


1-215 


I215,ttttcggg 


501 


1-430 


S275, n; 1430, aggat 


502 


9M13 


Dl, 90; 1413, aaacgggg 


504 


2 1 -420 


Dl, 20; S47, w; S83, n; 1280, n; S281, na; 
S292, v; S3 14, sm; S368, ww; S373, w; 1420, 
cccca 


505 


18-457 


Dl, 17; D36, 1; SI 82, g; S273, n; S283, a; 
S416, bh;I457, ctcga 


514 


1-303 


1303, accca 


515 


1-455 


SI 1, t; 112, n; S30, r; S256, wr; 1333, t; 1455, 
cataa 


517 


24-453 


Dl, 23; 1453, agagcggg 


519 


1-275 


11 19, gt; S125, w; 1129, w; S133, k; S137, k; 
SI 67, k; I275,gcccc 


522 


1-313 


1313, agcgtggg 


526 


4-366 


10, t;Dl,3;I366, ggcccggg 


530 


1-434 


S328, g; 1434, aagat 


535 


1-379 


S128,g;S162, m; D380,5 


561 


2-341 


Dl, 1; I341,raagagg 


568 


1-246 


11 18, g; SI 37, g; 1246, aaaccggg 


570 


1-207 


1207, ttttt 


576 


1-288 


134, c; 1288, cccgtgg 


588 


1-390 


S218, a; S224, k; S3 14, dh; S358, s; D376, 1; 
1390, atg; D391,23 


597 


31-274 


Dl, 30; S49, n; 1274, tccatgg 


606 


1-354 


1141, g; D174, 1; S229,rr; D355, 72 


627 


M15 


S7, a; 1415, cattt 


634 


1-178 


D179,212 


640 


6^28 


D1,5;D429, 79 


641 


64-483 


Dl, 63; 1165, d; D183, 1; S185, y; S253, t; 
D279, 2; S4i6, a; 1483, atata 


655 


1-280 


S58, c; 184, g; S88, k; S204, ac; S244, g; S247, 
g; 1280, ggg; D281,90 


672 


34-489 


Dl, 33; S3 16, k; S331, k; S333, w; S486, g; 
S488, c; D490, 4 


687 


116-473 


Dl, 115;S142, n; 1473, cctcgggg 


697 


1-202 


S142, s; S144, sr; S148, d; S152, d; 1155, a; 
1164, a; SI 74, k; 1202, gcc; D203, 291 


708 


8-384 


D1,7;S104, b; 1384, gaaaa 


710 


M67 


S40, k; S49, db;I167, tatct 
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111 


1-191 


1125. c; 1191, ttttt 


111> 


1-316 


1316, aggg;D317, 157 


729 


15-373 


Dl, 14; S139, t; 1373, cgcag; D374, 99 


730 


29-372 


Dl, 28; 1155, g; S192, ka; S333, d; 1372, m; 
D373, 93 


731 


1-290 


SIO, kk; S30, b; S32, t; S92, t; SI 97, dy; S278, 
g;I290, aggg; D291,55 


732 


8-277 


Dl, 7; 11 13, a; S127, w; 1131, s; S132, r; S156, 
w; S160, r; S21 1, n; S215, w; 1247, a; D278, 
121 


733 


20-375 


Dl, 19; S306, sbs; 1325, h; S326, nr; S338, 
ywd; S344, v; 1375, aggg; D376, 68 


734 


1-359 


D66, 1; D360, 14 


735 


25-322 


Dl , 24; S30, r; 11 93, a; 1322, ccaaggg 


736 


9-181 


Dl,8; S58, g; 1181, aactaggg 


737 


1-160 


S97, ta; 1160, aggtc 


738 


1-227 


D228, 7 


739 


45-514 


Dl, 44; S178, s; 1182, c; S436, dmn; S461, v; 
S476, c; S506, t; D515, 75 


740 


11-388 


Dl, 10; 1388, cgacaggg 


741 


1-478 


SI 18, s; S 125, a; 1126, s; S134,k; S421,vn; 
1478, aatsc 


742 


217-553 


10, tt; Dl, 216; S286, r, S294, m; S31 1, r; 
S317, s; S338, r; S442, dm; S469, h; S476, r; 
S485, s; S491, w; 1495, ht; S496. v; S513, r; 
D521, l;S536,m; D554, 199 


743 


1-459 


11 1, s; S258, m; 1270, m; 1304, c; 1308, amta; 
S3 13, c; S438, v; 1459, agggag 


744 


25-316 


Dl,24; S315, g; D317, 95 


745 


21-283 


Dl, 20; 140, g; 841, c; D123. 1; S181, sr, S227, 
r; 1283, ccgcg; D284, 121 


746 


1-256 


D257, 173 


747 


1-179 


SI 34, w; S138, w; 8140, kt; 1179, caeca 


748 


1-235 


846, t; 172, t; S189, cc; 8222. c; D236, 148 


749 


2-370 


Dl, 1; S32, eg; D144, 1; S341. g; D371, 76 


750 


18-410 


10, aag;Dl, 17; 1410, aatcc 


751 


22-355 


D1,21;D148, 1; 8150, c; 8152, a; 8313, n; 
D356, 181 


752 


1-139 


S50, t;1118, g;1139, ccet 


753 


1-189 


826, r; 81 15, s; 1121, r; 8122, r; 8128, s; 8143, 
r;1146, w; 8156, r; D190, 4 


754 


1-395 


8212, wd; 1395,cggca 


755 


19-460 


Dl, 18; 826. c; 8156, a; 8253, n; 1460. tagaagg 


756 


2-142 


Dl, 1; 1106. gc; S107, t; 8110, c; 1142, 

ccaccggg 


757 


28-296 


Dl. 27; 11 19. s; 1122, t; 8128. s; 8255, t; 8267. 
m; D297. 66 


758 


11-368 


Dl, 10; 1200, g; S201, c; 8281, d; 8317, c; 
1368, ccatcggg 


759 


19-452 


Dl, 18;S421. w;I452,a 


760 


25-175 


Dl, 24; 834, yk; 1175, ccggg; D176, 120 


761 


1-212 


1212, cactcggg 


762 


1-374 


S320, s; 8349, a; D375. 249 


763 


8-152 


D1.7:I152,acKeB;D153. 109 
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764 


1-160 


1127, g; 1145. g; 1160, cgcccggg 


765 


137-313 


Dl, 136; S272, m; 1279, s; S310, t; 1313, ggg; 
D314, 203 


766 


1-320 


S278, ag; S281, cagacc; S288, ta; S291, caag; 
S296, c; S3 17, m; 1320, cggg; D32I, 306 


767 


6-336 


10, aa; Dl, 5; S149, w; S245, y; D337, 137 


768 


1-374 


S320, s; D375, 299 


769 


53-435 


Dl, 52; S59, b; S344, nnkw; D436, 104 


770 


24-448 


Dl, 23; S25, g; S411, w; S416, m; D449, 31 


771 


1-370 


S3, c; SI 80, m; S275, r; D371, 122 


112 


1-388 


1299, c; S326, c; D389, 8 


773 


1-143 


S18,c; S66,a;I143,ggg;D144, 274 


774 


1-347 


S194, a; S205, c; 1347, ggg; D348, 107 


775 


5-207 


Dl,4;Slll,tg;S158, g; S171, c; S191, a; 
S204, a; S206, c; 1207, gg; D208, 324 


776 


1-368 


1200, c; S201, a; S291, ta; 1332, c 


111 


5-207 


Dl, 4; S204, a; S206, c; 1207, gg; D208, 262 


778 


39-342 


D1.38;S184.r;D343, 126 


779 


4-360 


Dl, 3; S13, m; 815, c; S22, s; S24, m; S48, r; 
S56, s; 8335, c; 8345, rs; 1360, ggg; D361, 1 19 


780 


1-472 


1347, c; D473, 32 


781 


116-426 


Dl, 115;S219,m;S424,g;D427, 118 


782 


1-391 


8386, k; D392, 64 


783 


1-453 


D109, 1; SI 10, y; S125, y; 1128, g; 3132, k; 
1453, ctctc 


784 


29-494 


Dl,28; S72, r; D495, 93 


785 


99-461 


Dl, 98; 82 18, r, 1461, gaccgggg 


786 


2-465 


Dl, 1; 88, y; 8388, s; 1398, g; 8400, t; 8403, 
at; 8417, g; D466, 24 


787 


28-271 


Dl, 27; 899, t; 8230, c; S266, ga; 8269, c; 
I271,g;D272, 126 


788 


1-285 


D280, 1;I285, g; D286, 310 


789 


1-209 


S205, c;D21 0,150 


790 


51-297 


Dl,50; 1297, ggggg; D298, 539 


791 


113-327 


Dl, 112; 8218, g; 1226, g; D280, 1; 1327, 
cgcaggg; D328, 224 


792 


17-218 


Dl, 16; 858, t; 8217, t; 1218, gggg; D219, 219 


793 


11-92 


Dl, 10; 891, c; 192, a; D93, 258 


794 


9-431 


Dl,8;I431,taaEt 


795 


30-341 


Dl. 29: 1341. a; D342, 175 


796 


1-442 


S17, w; 819, wr; D35, 1; S134, t; 8264, n; 
S322, nr; 8369, s; 8420, s; 8422, y; 1442, 
tcctcggg 


797 


1-420 


S136, c; S150. c; 1245, ccc; 1420, ggagtg 


798 


25-316 


Dl,24; S315, D317,97 


799 


1-344 


D345, 57 


800 


7-465 


D1.6; 859, k; 8146, a; 8186, km; 1465, gttca 


801 


121-422 


Dl. 120; 1269, c; S419, cc; 1422, gg; D423, 
207 


802 


46-477 


Dl. 45: 8132. bn: 1477. actac 


803 


15-467 


Dl. 14; S45, k; 865, t; 8418, ys; D452, 1; 
D468, 119 


804 


1-341 


842, t; 897, d; 8326. gtg; 8331, tgt; S336, a; 
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S338, c; 1341, cccccggg; D342, 218 


805 


2-409 


Dl, 1; S334, d; 1409, aggg; D410, 161 


806 


5-384 


Dl,4; 1384, actaa 


807 


1-301 


SI 13, a; SI 17, c; S123, t; D128, 1; D134, 1; 
S282, g; S284, a; 1301, gacggagggg; D302, 70 


808 


2-314 


Dl, 1;S306, g; 1314, ggg; D315, 121 


809 


1-394 


S53, g; S228, n; S272, vk; 1301, g; 1358, m; 
S368, nb; S375, w; 1383, mm; 1388, yt; 1394, 

nhaccggg 


810 


6-205 


10, a; Dl, 5; 1141, t; 1205, ggg; D206, 630 


811 


6-270 


D1,5;I270, gggg; D271, 1 15 


1600 


1-247 


S45, m; SI 14, k; 1122, m; S123, yc; S158, rr, 

S221,k; 1247, ccccaggg 


1601 


1-225 


SI 09, bm; S195, m; 1225, tgcacggg 


1602 


23-245 


D1,22;D138, 1; S139, s; 8242, t; S244.g; 

1245, g; D246, 13 


1603 


1-303 


S71, c; D277, 1; 1303, ggagggg; D304, 38 


1604 


1-242 


S47, w; S50, c; S81, h; S85, d; 891, k; 8106, r, 
1242, tgtggg; D243, 50 


1605 


2-225 


Dl, 1; S20, k; S91, c; 1225, ggg; D226, 132 


1606 


15-293 


Dl, 14; S156, g; 8193, g; 1200, t; 1293, 

acaaaggg 


1607 


1-361 


S323, c; 1361,cccca 


1608 


1-151 


I151,taagggg;D152, 154 


1609 


1-242 


S55, s; 1135, a; S152, h; 1242, cagtaggg 


1610 


1-196 


1151, w; SI 90, k;1196, cctgtgg 


1611 


1-228 


S 1 1 5, k; S 1 74, rk; 1228, cgtttggg 


1612 


1-221 


SI 08, v; I221,tgatcggg 


1613 


1-281 


166, w;I137, a; D282, 79 


1614 


1-171 


S53, k; S76, k; 180, k; 881, kw; 886, r; 892, k; 
SI 26, k;I171,gccgagg 


1615 


2-193 


Dl, 1; 867, c; 1121, s; 8122, mm; 8126, g; 
S130, r, S146, r; 8156, gm; 1193, cctca 


1616 


1-349 


S251, ww; 8259, rs; 8275, k; 1279, w; 8285, y; 
8292, y; 1320, m; 1331, m; 1338, w; 1341, s; 
1349, accccggg 


1617 


1-129 


1118, t;D130, 26 


1618 


1-184 


D9, 1;D185, 1 


1619 


1-169 


1122, t; 1169, gcccaggg 


1620 


1-187 


S106, k; 8118, m; 8122, eg; 8132, k; D188, 59 


1621 


1-153 


D125, 1; 1131, ttt; 8152, t; 1153. gg; D154, 127 


1622 


1-400 


843, s; 1126, g; 1129, y; S353. d; 1400. tatat 



EXAMPLE 16 

Categorization of 5' ESTs and Consensus Contigated S'ESTs 
The nucleic acid sequences of the present invention (SEQ ID NOs. 24-81 1 and 1600-1622) were 
5 grouped based on their homology to known sequences as follows. All sequences were compared to 
EMBL release 57 and daily releases available at the time of filing using BLASTN. All matches with a 
minimum of 25 nucleotides with 90% homology were retrieved and used to compute Tables IV and V. 
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In some embodiments, 5'ESTs or consensus contigated 5'ESTs nucleic acid sequence do not 
match any known vertebrate sequence nor any publicly available EST sequence, thus being completely 
new. 

In other embodiments, 5'ESTs or consensus contigated 5'ESTs match a known sequence. 
5 Tables III and IV gives for each sequence of the invention in this category referred to by its sequence 
identification number in the first column, the positions of their preferred fragments in the second column 
entitled "Positions of preferred fragments " As used herein the term "polynucleotide described in Table 
III" refers to the all of the preferred polynucleotide fragments defined in Table HI in this manner, and the 
term "polynucleotide described in Table IV" refers to the all of the preferred polynucleotides fragments 

10 defined in Table IV in this manner. The present invention encompasses isolated, purified, or recombinant 
nucleic acids which consist of, consist essentially of, or comprise a contiguous span of at least 8, 10, 12, 
15, 18, 20, 25, 35, 40, 50, 70, 80, 100, 250, or 500 nucleotides in length, to the extent that a 
contiguous span of these lengths is consistent with the lengths of the particular polynucleotide, of a 
polynucleotide described in Table III or Table IV, or a sequence complementary thereto, wherein said 

1 5 polynucleotide described in Table III or Talbe IV is selected individually or in any combination from the 
polynucleotides described in Table III or Talbe fV. The present invention also encompasses isolated, 
purified, or recombinant nucleic acids which consist of or consist essentially of a polynucleotide 
described in Table III or Table IV, or a sequence complementary thereto, wherein said polynucleotide is 
selected individually or in any combination from the polynucleotides described in Table HI or Table IV. 

20 

Table m 



SEQID 
NO 


Positions of preferred 
fragments 


24 


1-251 


25 


1-83 


28 


227-276 


29 


1-27 


30 


130-242, 283-315, 365-461 


32 


314-399 


33 


89-321 


34 


1-38 


35 


1-52, 171-222 


36 


1-30,408^41 


37 


1-138 


39 


115-140 


40 


1-97 


41 


1-112 


42 


1-177 


46 


1-38 


48 


376-400 


51 


400-466 


54 


1-259 


55 


189-320 
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56 


265-457 


58 


246-469 


59 


81-123,418-444 


60 


1-348 


61 


78-123,418-457 


62 


386-439 


63 


1-214 


64 


109-297 


65 


1-370 


66 


92-428 


68 


1-180 


69 


165-259 


70 


1-178 


71 


1-27 


72 


1-179 


73 


1-65, 107-192 


75 


1-314 


77 


263-388 


78 


1-64 


79 


1-149 


80 


101-142, 302-380 


82 


1-192 


83 


1-398 


85 


1-290 


86 


1-118, 149-336 


87 


1-262 


88 


1-149 


89 


1-315 


90 


1-74 


91 


1-335, 364-423 


92 


1-316 


93 


338-508 


94 


179-321 


95 


219-402 


96 


26-315 


97 


348-460 


98 


1-230 


99 


391-467 


101 


214-336 


102 


1-289 


103 


1-383 


104 


1-211 


105 


1-36 


106 


1-126 


107 


1-49 


108 


294-336 


109 


1-128 


111 


1-154 


112 


407-441 


113 


1-80, 139-184 


114 


10-79 


116 


1-292 


117 


1-304 
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119 


1-288 


120 


2-348 


121 


1-122 


123 


188-353 


124 


1-249 


125 


295-375 


128 


1-244 


129 


1-232 


130 


196-312 


131 


178-276 


132 


37-174 


133 


1-344 


134 


1-244 


135 


1-217 


136 


82-428 


137 


1-29, 103-155,274-434 


138 


1-395 


139 


1-268 


140 


1-170 


141 


1-396 


142 


1-73, 227-357 


143 


1-159 


144 


1-433 


145 


61-116 


146 


1-71, 179-205 


147 


177-300 


149 


1-146 


151 


1-166 


152 


1-382 


153 


1-208 


154 


121-251 


155 


1-147 


157 


1-115 


158 


1-175 


159 


1-44, 80-230 


160 


1-346 


161 


1-277 


162 


1-235 


163 


1-34 


164 


1-195 


165 


19-78, 175-217 


166 


1-209 


167 


1-65 


168 


128-218 


169 


49-245 


170 


179-280 


171 


1-103 


172 


1-218 


173 


1-380 


174 


1-139 


175 


1-122 


176 


1-300 


177 


M66 
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179 


1-86 


180 


1-245 


181 


1-241 


182 


1-263 


183 


1-170 


184 


58-106, 399-443 


185 


1-427 


186 


1-365 


187 


1-260 


188 


1-172 


189 


1-150 


190 


161-271,301-339 


191 


1-91 


192 


1-264 


193 


1-246 


194 


1-150 


195 


1-209 


196 


1-363 


197 


1-155 


198 


1-135 


200 


1-125 


201 


1-210 


202 


1-338 


203 


1-188 


204 


228-347 


205 


1-440 


206 


56-221 


208 


1-422 


209 


169-195 


210 


1-363 


211 


1-368 


212 


1-448 


213 


1-134 


214 


1-193 


215 


1-214 


216 


1-134 


218 


1-189 


219 


1-248 


220 


1-115 


221 


1-113 


222 


1-370 


224 


1-251 


225 


1-198 


226 


45-141 


227 


1-206 


228 


1-480 


229 


1-144 


230 


1-42, 281-351,432-457 


231 


1-112 


233 


1-301 


234 


1-109 


235 


1-393 


236 


1-222 
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237 


1-154 


238 


1-439 


239 


1 12-137 


240 


1-194 


241 


1-44 


242 


1-242 


244 


1-324 


245 


t *> O 11*7 ^Of\ 

1-38, 217-280 


246 


1-60 


247 


77-359 


248 


1-236 


249 


1-342 


250 


80-382 


251 


1-303 


252 


62-259 


253 


1-165 


254 


1-328 


255 


1-320 


256 


1-305 


257 


1-181 


258 


116-174 


259 


1-265 


260 


1-272 


261 


1-62 


263 


1-371 


266 


1-274 


267 


1-342 


268 


364-427 


269 


31-143 


270 


1-79 


271 


1-121 


272 


229-292 


273 


1-158 


274 


1-113 


275 


1-254 


276 


1-333 


277 


1-130 


278 


1-184 


279 


1-265 


280 


1-188 


281 


1-177 


282 


1-336 


283 


1-294 


284 


1-171 


285 


1-297 


288 


1-42 


290 


1-170 


292 


20-155 


294 


1-334 


295 


1-375 


296 


1-226 


297 


1-232 


299 


40-139 
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300 


1-285 


301 


1-242 


302 


1-136 


303 


1-175 


304 


1-493 


305 


1-214 


306 


89^58 


307 


1-328 


308 


1-380 


309 


1-236 


310 


1-357 


311 


1-470 


312 


1-187 


313 


1-159 


315 


1-162 


316 


1-404 


317 


1-450 


318 


1-395 


319 


1-257 


320 


56-325 


321 


1-201 


322 


1-159 


323 


1-420 


324 


1-210 


325 


1-192 


326 


88-181 


327 


1-185 


328 


128-210 


330 


1-223 


331 


1-362 


332 


1-89 


334 


1-188 


335 


1-115 


336 


1-300 


337 


1-307 


338 


1-123 


339 


1-297 


340 


1-34 


341 


1-44 


342 


1-37 


343 


141-169 


344 


1-112 


345 


1-235, 266-349 


346 


1-191 


347 


1-229 


348 


1-210 


350 


139-266 


351 


1-307 


352 


1-170 


353 


1-293 


354 


30-161. 192-331 


355 


1-93 


356 


1-178 
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357 


1-107 


358 


1-29, 168-209 


359 


1-298 


360 


1-193 


362 


1-360 


363 


1-45, 100-212 


364 


39-170,202-242 


365 


1-248 


366 


1-351 


367 


1-208 


368 


228-446 


369 


1-62 


370 


1-132 


371 


1-127 


372 


1-196 


373 


1-148 


374 


1-126 


375 


1-112 


376 


1-146 


378 


1-143 


379 


1-261 


380 


202-228 


382 


1-151 


383 


1-45 


384 


1-190, 250-456 


385 


1-55, 141-181 


386 


1-281 


387 


1-111 


388 


1-374 


389 


1-192 


390 


1-371 


392 


1-303 


394 


1-126 


395 


1-329 


396 


1-99 


397 


1-316 


398 


1-251 


399 


1-120 


401 


1-206 


402 


1-330 


403 


1-311 


405 


1-153 


406 


1-206 


407 


1-479 


408 


1-289 


410 


229-321 


413 


1-158 


415 


95-229 


416 


1-265 


417 


1-228 


418 


1-225 


419 


207-293 


420 


1-194 
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421 


1-90 


422 


1-161 


423 


1-420 


424 


1-432 


425 


1-276, 309-419 


426 


1-232 


427 


1-81 


428 


1-96 


429 


1-165 


431 


1-58, 186-237, 327-354 


433 


1-65 


434 


1-83 


435 


1-386 


436 


405-447 


438 


1-106 


439 


45-105, 168-255,284-447 


441 


1-409 


442 


1-320 


443 


1-256 


444 


1-284 


445 


1-240 


446 


1-149 


447 


1-360 


448 


1-123 


449 


1-94 


450 


1-302 


452 


1-349 


453 


1-270 


454 


M92 


455 


17-105 


456 


1-102 


457 


1-108 


458 


1-285 


459 


1-311 


460 


1-191 


461 


312-420 


462 


1-257 


463 


1-117 


464 


1-142 


466 


1-235 


467 


1-29 


468 


1-41 


469 


1-438 


470 


1-131 


471 


1-211 


472 


1-150 


473 


1-352 


474 


1-141 


476 


1-232 


478 


1-201 


479 


1-151 


480 


1-104 


481 


7-429 
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482 


1-385 


486 


1-226 


488 


1-296 


489 


1-72, 323-377 


491 


1-348 


492 


33-126 


493 


1-300 


494 


1-295 


495 


1-244 


496 


1-215 


497 


1-255 


499 


1-174, 384-474 


500 


1-50, 102-241 


501 


153-430 


502 


91-132 


503 


1-64 


504 


21-63, 356-420 


505 


37-68, 187-234 


506 


1-315 


507 


101-208 


510 


1-402 


511 


1-343 


512 


1-140, 170-246,276-420 


513 


1-324 


514 


1-303 


515 


13-340 


516 


1-263, 293-360 


518 


1-245 


519 


111-275 


520 


62-182 


521 


1-218 


523 


1-502 


524 


1-118 


525 


1-276 


526 


223-366 


527 


M28 


528 


297-342 


529 


1-244 


530 


1-88, 375-434 


531 


1-406 


533 


1-149 


534 


1-145 


535 


1-116 


536 


1-207 


537 


1-394 


538 


1-415 


539 


1-160 


540 


1-327 


541 


1-38, 73-396 


542 


1-247 


543 


1-221 


544 


1-375 


545 


1-376 
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546 


1-109 


547 


1-160, 223-306 


548 


1-148 


551 


1-231 


552 


1-229 


553 


1-232 


554 


1-141 


555 


1-376 


556 


1-279 


557 


1-340 


558 


1-51 


559 


1-354 


562 


1-188 


563 


1-229 


564 


184-352 


566 


308-341 


567 


1-218 


568 


1-79 


569 


1-142 


570 


1-207 


571 


1-373 


572 


1-195 


573 


1-352 


574 


1-121 


575 


1-222 


576 


151-288 


577 


1-264 


578 


1-205 


580 


1-171,273-328 


581 


1-356 


582 


1-239 


583 


1-144 


584 


1-282 


585 


1-338 


586 


1-436 


588 


1-380 


589 


1-60 


590 


1-178 


592 


1-66 


593 


1-215 


594 


1-161 


596 


1-407 


597 


31-83 


598 


1-417 


599 


1-329 


600 


1-311 


601 


1-61,99-214 


602 


1-154, 197-463 


603 


135-269 


604 


1-351 


605 


1-195 


608 


1-357 


609 


1-201 
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612 


1-176 


613 


1-342 


615 


1-272 


616 


1-114 


617 


1-46 


618 


1-208 


619 


1-257 


620 


1-28 


621 


1-26 


622 


1-221 


623 


1-432 


624 


1-233 


625 


1-26 


627 


M3 


628 


1-318 


629 


1-170 


630 


1-196 


631 


248-339 


632 


1-433 


633 


1-154 


634 


1-41 


635 


1-137 


636 


1-172 


637 


1-253 


638 


1-185 


639 


1-206 


641 


334483 


642 


1-309 


643 


1-75, 162-213 


644 


107-211 


645 


1-98 


646 


1-347 


647 


1-49,81-143 


648 


1-232 


649 


74-133 


650 


1-37 


651 


1-276 


652 


1-170 


653 


1-178 


654 


1-121 


656 


1-197 


657 


1-246 


659 


1-197 


660 


116-172 


661 


1-411 


662 


1-146 


663 


1-65 


664 


1-182 


665 


1-320 


666 


1-273 


667 


1-149 


668 


1-122 


670 


1-160 
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671 


1-137 


673 


1-263 


^1 A 

61A 


1-263 


675 


1-107 


bll 


1-441 


678 


134-191 


679 


1-235 


680 


1-26 


682 


1-58, 269-328 


683 


1-447 


684 


1-217 


685 


1-132 


686 


1-60 


688 


1-107 


689 


132-221, 327-377 


690 


1-388 


691 


1-141, 171-408 


692 


1-322 


693 


1-153 


695 


1-455 


698 


1-58, 117-174 


699 


240-300 


700 


1-159 


701 


1-69 


702 


1-175 


703 


1-298 


704 


1-136 


705 


1-168 


706 


1-419 


707 


1-382 


708 


8-245, 296-384 


709 


1-149 


710 


1-167 


711 


1-35 


712 


1-80, 116-156, 206-241 


713 


33-376 


714 


1-304 


715 


1-242 


717 


1-145 


718 


1-350 


720 


1-257 


721 


1-360 


722 


1-191 


724 


1-139 


726 


1-207 


727 


99-164 


728 


1-321 


730 


156-372 


731 


1-109, 256-290 


735 


25-192 


737 


1-160 


738 


1-227 


739 


441-514 
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742 


217-280 


743 


10-275 


747 


1-179 


749 


2-31, 139-168 


750 


349-410 


752 


1-119 


753 


1-121 


754 


1-28 


760 


25-175 


761 


1-212 


763 


8-75 


766 


1-59, 102-248, 295-320 


769 


53-85 


771 


1-370 


774 


1-347 


776 


1-200 


778 


39-342 


779 


4-28 


780 


M9, 407-472 


781 


116-426 


782 


1-59 


783 


1-53,219-453 


784 


29-53,219-263,426-494 


785 


99-347, 386-461 


786 


2-28 


788 


1-279 


789 


1-58 


790 


226-268 


792 


129-218 


794 


265-431 


796 


5-86 


797 


1-34 


799 


1-344 


802 


46-477 


806 


64-384 


807 


135-301 


808 


2-314 


810 


6-39 


1600 


1-25 


1601 


1-225 


1602 


23-139 


1603 


1-294 


1606 


15-44 


1607 


1-361 


1611 


85-228 


1612 


1-221 


1613 


138-281 


1614 


65-171 


1615 


2-142 


1616 


1-46 


1617 


1-95 


1620 


1-187 


1621 


1-136 
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1622 



32-280,311-400 



Table IV 





A Uaiiiuiia ui i leici r eu 




1-52 


41 






1-47 




I J J 




400-428 




0-? 1 '-r^ 










114 


1-79 


120 


1-386 


121 


1-190 


124 


1-249 


125 


295-328 


139 


1-81, 125-268 


159 


1-139 180-230 


165 


1-78 


170 


179-205 248-280 


194 


1-150 


213 


1-158 


247 


1-104.155-183, 280-359 


269 


31-143 


350 


139-386 


368 


228-446 


385 


1-72, 143-193 


415 


95-229 


435 


1-386 


436 


446-472 


441 


1-361 


454 


1-349 


455 


1-105 


459 


35-161,200-311 


460 


1-26, 56-140 


481 


1-429 


489 


1-84 


496 


\JiA, 84-215 


501 


153-430 


502 


1-91 


504 


1-63 
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505 


1-68 


514 


1-303 


515 


237-351 


519 


1-145 


526 


231-366 


530 


1-88 


535 


1-55 


570 


76-207 


576 


168-218, 261-288 


588 


1-331 


597 


1-83 


627 


M3 


634 


1-41 


641 


1-55, 334-483 


672 


1-34 


687 


1-129 


708 


1-245,296-384 


710 


1-26, 104-167 


722 


1-191 


730 


1-465 


731 


1-43 


735 


1-91 


737 


1-160 


738 


1-186 


739 


1-48 


742 


1-62, 99-248 


743 


1-315,412^59 


744 


1-31 


747 


1-63 


749 


1-32 


750 


1-38 


752 


1-139 


753 


1-193 


754 


1-28 


759 


1-38 


760 


1-115 


763 


1-62 


765 


1-126 


769 


1-85 


770 


1-40 


771 


1-148 


774 


1-134 


775 


265-531 


776 


71-203 


111 


333-469 


778 


144-468 


779 


1-28 


780 


1-49 


781 


1-102 


782 


1-59 


783 


1-53 


784 


1-220, 262-390 


785 


1-339,408-461 
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786 


1-28 


789 


1-58 


791 


1-126 


792 


1-31, 129-220 


793 


1-31 


794 


355-431 


795 


1-33 


797 


1-31 


798 


1-31 


799 


1-401 


801 


1-117 


802 


1-92 


806 


64-384 


807 


1-331 


808 


1-351 


810 


1-39 


1600 


1-25 


1603 


1-341 


1606 


1-31 


1607 


1-361 


1608 


164-305 


1611 


85-228 


1612 


1-221 


1613 


112-360 


1614 


1-171 


1615 


94-193 


1617 


1-155 


1620 


1-246 



in. Evaluation of Spatial and Temporal Expression of oiRNAs Corresponding to the 5*ESTs, 
Consensus Contigated 5'ESTs, or EST-related nucleic acids 

5 

EXAMPLE 17 

Expression Patterns of mRNAs From Which the 5'ESTs were obtained 
Each of the SEQ ID NOs. 24-81 1 and 1600-1622 was also categorized based on the tissue from 
which its corresponding mRNA was obtained, as follows. 
10 Table V shows the spatial distribution of each nucleic acid sequence of the invention (SEQ ID 

NOs. 24-81 1 and 1600-1622) referred to by its sequence identification number in the first column. In the 
second column entitled tissue distribution, the spatial distribution is represented by the number of 
individual 5'ESTs used to assemble the consensus contigated 5'ESTs for a given tissue. Each type of 
tissue listed in Table V is encoded by a letter. The correspondence between the letter code and the tissue 
1 5 type is given in Table VL 
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Table V 



SEO ID NO 


Tissue Distribution 


24 


AA:1 


25 


S:l 


26 


P:l 


27 


W:l 




P-1 

J. • 1 








1.1 


O i 


r. 1 




P'l 

r. 1 




P-1 


1A 




1^ 


/^.o. p.i. c.i. w-i. AA-4 




P-1 

Jr. 1 


n 


<sl 


JO 


ni 


jy 


p*i 


Aft 


ARl 




R*l C-3- F l- G l- H-4- S*2* T S* W l- Z l' AA:3: AC:1: AD:3 




A-1 






44 


P:l 


4S 


r-2- K l* O l- S-5 


4^ 


K-1- S'2* AA l 


47 


AA:1 


48 


C I* 0'l:P-8 


49 


P:l 


50 


P:l 


51 


P:l 


52 


S:l 


53 


AA:1 


54 


T:l 


55 


P:l 


56 


P:l 


57 


P:l 


58 


P:l 


59 


P:7; T:2; Z:i 


60 


R:l 


61 


C:l 


62 


P:l 


63 


F:l 


64 


AA:1 


65 


F:l 
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66 


P:4;T:2;Z:1 


67 


5:1 


68 


AA:1 


69 


P:l 


70 


P:l 


71 


S:l 


72 


W:l 


73 


G:l 


74 


P:l 


75 


N:l 


76 


P:l 


77 


S:l 


78 


U:l 


79 


B;l 


80 


P:l 


81 


AC:1 


82 


K:1;0:1 


83 


G:l 


84 


C: 1 ; K:2; P:29; S:2; T: 1 ; X:2; Y: 1 ; AA:2 


85 


K:l 


86 


C:l 


87 


F:l 


88 


AB:1 


89 


H:l 


90 


M:l 


91 


B:l 


92 


K:l 


93 


AC:2 


94 


P:l 


95 


M:l 


96 


Z:2 


97 


K:1;P:11;S:1;X:1; AA:1 


98 


W:l 


99 


X:l 


100 


P:l 


101 


AB:1 


102 


F:l 


103 


AA:1 


104 


K:l 


105 


B:4; C:6; E:2; H:3; 0:2; Q:l; S:3; AC:2 


106 


T:l 


107 


0:1 


108 


P:l 


109 


G:l 


110 


AA:1 


111 


T:l 


112 


P:l 


113 


F:l 



wo 99/53051 



PCT/IB99/00712 



55 



114 


B:3; C:4; K:5; S:4; Y:l 


115 


U:l 


116 


W:l 


117 


T:l 


118 


T:2 


119 


T:l 


120 


H:3 


121 


AA:3 


122 


K:l 


123 


H:2 


124 


AA:2 


125 


B:l; G:l; J:3; T:13; Y:5; AA:5; AD:2 


126 


H:1;P:1 


127 


K:l 


128 


F:l 


129 


G:l 


130 


P:l 


131 


B:l 


132 


AA:1 


133 


W:l 


134 


P:l 


135 


K:l 


136 


B:l; C:l 


137 


B:l 


138 


H:l 


139 


AC:2 


140 


T:l 


141 


B:l 


142 


H:l 


143 


T:l 


144 


H:l 


145 


B:l 


146 


R:l 


147 


P:l 


148 


C:1;H:2; 0:1;S:2; T:l; AC:2 


149 


H:l 


150 


AA:1 


151 


W:l 


152 


S:l 


153 


F:l 


154 


M:l 


155 


B:l 


156 


R:l 


157 


W:l 


158 


T:l 


159 


C:l; AA:1 


160 


F:l 


161 


H:l 



wo 99/53051 PCT/IB99/00712 

56 



162 


D:l 


163 


AA:1 


164 


AA:1 


165 


W:3 


166 


AA:1 


167 


W:l 


168 


F:l 


169 


B:l 


170 


G:2 


171 


E:l 


172 


B:l 


173 


F:l 


174 


B:l 


175 


W;l 


176 


K:l 


177 


AA:1 


178 


S:l 


179 


K:l 


180 


AA:1 


181 


W:l 


182 


K:l 


183 


T:l 


184 


P:l 


185 


B:l 


186 


W:l 


187 


R:l 


188 


T:l 


189 


T:l 


190 


W.l 


191 


A:l 


192 


F:l 


193 


B:l 


194 


G:3 


195 


W:l 


196 


0:1 


197 


T:l 


198 


0:1 


199 


B:l 


200 


AA:1 


201 


G:l 


202 


B:l 


203 


0:1 


204 


P:l 


205 


AA:1 


206 


Y:l 


207 


Y:l 


208 


AA:1 


209 


G;l 



wo 99/53051 PCT/IB99/00712 

57 



210 


H:l 


211 


C:l 


212 


H:l 


213 


W:2 


214 


Y:l 


215 


AB:1 


216 


K:l 


217 


M:l 


218 


AD: I 


219 


A:l 


220 


AA:1 


221 


G:l 


222 


G:l 


223 


G:1;H:2; S:2;X:l ' 


224 


G:l 


225 


G:l 


226 


B:l 


227 


P:l 


228 


0:1 


229 


G:l 


230 


T:l 


231 


T:l 


232 


K:l 


233 


S:l 


234 


0:1 


235 


F:l 


236 


T:l 


237 


B:l 


238 


W:l 


239 


G:l 


240 


R:l 


241 


A:l 


242 


W:l 


243 


P:l 


244 


H:l 


245 


D:l 


246 


C:l 


247 


B:2 


248 


P:l 


249 


F:l 


250 


AB:1 


251 


W:l 


252 


H:l 


253 


B:l 


254 


S:l 


255 


T:l 


256 


W:l 


257 


T:l 



wo 99/53051 



PCT/IB99/00712 



58 



258 


AA:2 


259 


P:l 


260 


W:l 


261 


H:l 


262 


K:l 


263 


K:l 


264 


C:l; E:l; F:l; 1:4; L:l; N:22; 0:1; P:l; S:l; T:9; AA:1 


265 


A:l 


266 


T:l 


267 


K:l 


268 


H:l 


269 


T:2 


270 


T:l 


271 


T:l 


272 


B:l 


273 


Y:l 


274 


T:l 


275 


G:l 


276 


AA:1 


277 


Til 


278 


AB:1 


279 


T:l 


280 


W:l 


281 


F:l 


282 


K:l 


283 


H:l 


284 


0:1 


285 


W:l 


286 


B:21; C:7; H:5; K:5; 0:8; S:16; W:l; Y:3; Z:4; AA:2; AC: 1 


287 


K:2; P:12; W:l; AC:2 


288 


S:l 


289 


K:2; P:8; W:l; AC:2 


290 


S:l 


291 


H:l 


292 


B:l 1; C:2; E:l; H:7; K:l; N:3; S:l; T:8; W:l; AA:28; AC:1 


293 


B:6; C:3; G:l; H:6; K:4; N:4; 0:3; Q:2; S:5; T:l; U:l; V:2; Y:3; AA:1 


294 


B:l 


295 


H:l 


296 


AA:1 


297 


T:l 


298 


T:l 


299 


T:l 


300 


H:1;S:1 


301 


H:l 


302 


W:l 


303 


W:l 


304 


H:l 


305 


G:l 



wo 99/53051 PCT/1B99/0071 2 

59 



306 


K:l 


307 


H:l 


308 


A:l 


309 


H:l 


310 


H:l 


311 


Y:l 


312 


G:l 


313 


H:l 


314 


K:l 


315 


Y:l 


316 


P:l 


317 


H:l 


318 


AA:1 


319 


H:l 


320 


0:1 


321 


Y:l 


322 


B:l 


323 


P:l 


324 


P:l 


325 


K:l 


326 


H:l 


327 


H:l 


328 


Q:l 


329 


S:l 


330 


B:l 


331 


T:l 


332 


T:l 


333 


B:l 


334 


T:l ' 


335 


W:l 


336 


P:l 


337 


A:l 


338 


AA:1 


339 


AA:1 


340 


G:l 


341 


C:l 


342 


K:l 


343 


S:l 


344 


G:l 


345 


B:l 


346 


Y:l 


347 


0:1 


348 


F:l 


349 


AA:5 


350 


8:15; C:l; G:l; H:l; 0:1; Q:2; S:1;X:1; Y:l 


351 


F:l 


352 


R:l 


353 


0:1 



wo 99/53051 PCT/1B99/00712 

60 



354 


H:l 


355 


W:l 


356 


F:l 


357 


T:l 


358 


S:l 


359 


X:l 


360 


T:l 


361 


K;l 


362 


K:l 


363 


G:l 


364 


K:l 


365 


G:l 


366 


AA:1 


367 


F:l 


368 


C:2; H:2; X:l 


369 


E:l 


370 


T:l 


371 


H:l 


372 


G:l 


373 


AA:1 


374 


0:1 


375 


F:l 


376 


F:l 


377 


R:l 


378 


AA:1 


379 


AA:1 


380 


C:l 


381 


H:l 


382 


T:l 


383 


W:l 


384 


S:l 


385 


AA:2 


386 


D:l 


387 


0:1 


388 


W:l 


389 


F:l 


390 


W:l 


391 


K:l 


392 


W:l 


393 


K:l 


394 


T:l 


395 


H:l 


396 


T:l 


397 


T;l 


398 


G:l 


399 


C:l 


400 


K:l 


401 


B:l 



wo 99/53051 PCT/IB99/007I2 

61 



402 


H:l 


403 


B:l 


404 


B:l 


405 


H:l 


406 


AB:1 


407 


0;1 


408 


P:l 


409 


X:l 


410 


H:l 


411 


B:9; C:3; K:3; L:2; 0:1; S:2; X:l; AA:1 


412 


G:1;S:2; V:2; W:1;Y:1;Z:1 


413 


W:l 


414 


G:l 


415 


B:3; C:3; F:l; G:2; H:4; J:l; K:l; 0:1; P:3; S:l; V:l 


416 


1:1 


417 


F:l 


418 


F:l 


419 


F:l 


420 


AA:1 


421 


F:l 


422 


T:l 


423 


P:l 


424 


B:l 


425 


Y:l 


426 


W:l 


427 


AA:1 


428 


W:l 


429 


H:l 


430 


Y:l 


431 


J:l 


432 


AA:1 


433 


0:1 


434 


AA:1 


435 


B:3;H:1 


436 


B:9; 0:4; H:8; K:2; 0:2; W:l; Z:2; AA:2; AD:3 


437 


H:1;T:1 


438 


T:l 


439 


R:l 


440 


M:l 


441 


H:2 


442 


W:l 


443 


B:l 


444 


W:l 


445 


AB:1 


446 


F:l 


447 


AD:1 


448 


AB:1 


449 


N:l 



wo 99/53051 PCT/IB99/00712 

62 



450 


T:l 


451 


W:l 


452 


0:1 


453 


AA:1 


454 


D:28 


455 


W:l 


456 


T:l 


457 


G:l 


458 


W:l 


459 


Y:4 


460 


B:3 


461 


P:2 


462 


K:l 


463 


T:l 


464 


H:l 


465 


G:l 


466 


AC:1 


467 


R:l 


468 


S:l 


469 


B:l 


470 


S:l 


471 


T:l 


472 


AA:1 


473 


W:l 


474 


T:l 


475 


S:l 


476 


T:l 


477 


AA:1 


478 


G:l 


479 


W:l 


480 


B:l 


481 


0:2 


482 


K:l 


483 


P:l 


484 


W:l 


485 


P:l 


486 


B:l 


487 


Y:l 


488 


H:l 


489 


P:1;Q:1;S:3 


490 


C:l 


491 


S:l 


492 


H:l 


493 


B:l 


494 


H:l 


495 


0:1 


496 


N:2 


497 


B:l 



WO99/5305I PCT/IB99/00712 

63 



498 


G:l 


499 


P:l 


500 


G:l 


501 


C:1;K:1;Q:1 


502 


B:4 


503 


R:l 


504 


B:5; H:2; W:2 


505 


G:2; H:l 


506 


W:l 


507 


B:l 


508 


W;l 


509 


AB:1 


510 


H:l 


511 


N:l 


512 


J:l 


513 


AA:1 


514 


T:2 


515 


AA:5 


516 


F:l 


517 


C:1;0:1 


518 


W:l 


519 


T:4 


520 


B:l 


521 


H:l 


522 


H:2; T:3 


523 


H:l 


524 


AA:1 


525 


W:l 


526 


C:2; E:1;J:1;R:3;S:4; AA:1 


527 


H:l 


528 


S:l 


529 


P:l 


530 


B:1;H:1 


531 


0:1 


532 


Y:l 


533 


H:l 


534 


T:l 


535 


T:2 


536 


B:l 


537 


ADA 


538 


AAA 


539 


T:l 


540 


F:l 


541 


AD:1 


542 


W:l 


543 


W:l 


544 


F:l 


545 


T:l 



wo 99/53051 PCT/1B99/00712 

64 



546 


F:l 


547 


K:l 


548 


Y:l 


549 


S:l 


550 


B:l 


551 


B:l 


552 


B:l 


553 


H:l 


554 


P:l 


555 


G:l 


556 


H:l 


557 


K:l 


558 


B:l 


559 


R:l 


560 


AB:1 


561 


C:1;S:1;V:1 


562 


AA:1 


563 


K:l 


564 


P:l 


565 


K:l 


566 


G:l 


567 


W:l 


568 


E:1;W:2 


569 


W:l 


570 


3:2 


571 


0:1 


572 


T:l 


573 


B:l 


574 


T:l 


575 


B:l 


576 


B:3 


577 


B:l 


578 


X:l 


579 


H:l 


580 


AA:1 


581 


AA:1 


582 


AA:1 


583 


AA:1 


584 


AA:1 


585 


D:l 


586 


H:l 


587 


H:l 


588 


AA:3 


589 


K;l 


590 


W:l 


591 


K:l 


592 


W:l 


593 


B:l 



wo 99/53051 PCT/1B99/00712 

65 



594 


V:l 


595 


R:l 


596 


P:l 


597 


G:1;X:2; Z:l 


598 


X:l 


599 


F:l 


600 


F:l 


601 


Y:l 


602 


F:l 


603 


W:l 


604 


H:l 


605 


G:l 


606 


C:2;H:1;S:3; W:2; AD:3 


607 


W:l 


608 


C:l 


609 


F:l 


610 


K:l 


611 


M:l 


612 


AD:1 


613 


H:l 


614 


T:l 


615 


H:l 


616 


F:l 


617 


T:l 


618 


G:l 


619 


0:1 


620 


B:l 


621 


W:l 


622 


W:l 


623 


T:l 


624 


AA:1 


625 


G:l 


626 


M:l 


627 


C:2; T:2; W;l; Y:l 


628 


T:l 


629 


J:l 


630 


T:l 


631 


P:l 


632 


H:l 


633 


H:l 


634 


C:1;S:1;T:1; AD:1 


635 


J:l 


636 


G:l 


637 


W:l 


638 


AA:1 


639 


W:l 


640 


B:6; C:3; G:l; H:2; K:6; OA; Q:l; R:2; S:l; T:3; Y:3; Z:2; AA:2; AC:2; 
AD:3 



wo 99/53051 PCT/IB99/007I2 

66 



641 


B:21;C:2; G:5; W:4; Y:l 


642 


AA:1 


643 


P:l 


644 


AA:1 


645 


T:l 


646 


K:l 


647 


F:l 


648 


F:l 


649 


F:l 


650 


T:l 


651 


W:I 


652 


T:l 


653 


T:l 


654 


P:l 


655 


B:l;H:2; N:1;T:3; Y:l 


656 


B:l 


657 


T:l 


658 


R:l 


659 


K:l 


660 


W:l 


661 


AA:1 


662 


Y:l 


663 


W:l 


664 


G:l 


665 


S:l 


666 


Y:l 


667 


F:l 


668 


T:l 


669 


3:1 


670 


F:l 


671 


T:l 


672 


A:2; 3:6; C:l; G:l; H:3; J:l; L:l; P:2; Q:l; S:4; T:l; V:3; W:2; Y:l; 
AA:3; AD:2 


673 


T:l 


674 


G:l 


675 


F:l 


676 


M:l 


677 


G:l 


678 


Y:l 


679 


D:l 


680 


P:l 


681 


D:l 


682 


AArl 


683 


G:l 


684 


K:l 


685 


G:l 


686 


P:l 


687 


3:3; C:2; D:2; E:2; J:4; V:2; AC:6 



wo 99/53051 



PCT/IB99/00712 



67 



/COO 

ooo 


A A • 1 

AA: I 


ooV 




DVU 


A A • 1 

AA.l 


<01 
0^1 


"LI* 1 


oyZ 


A A • 1 


iCOl 






Ar>. 1 




1 .1 


oyo 


rl. 1 


nan 


0./1. v*^^ p« 1 • T»'>» ^.o 
r>.4, li.l, r.l, .1, 1-^, 


mo 


U.i 


byy 


117. 1 


7UU 


C. 1 


701 


r\. 1 
0:1 


702 


B:l 


703 


AB:1 


704 


H:l 


705 


T> - 1 

B:l 


706 


T I . 1 


707 


G:l 


708 


F:l; H:1;K:1; W:2; AA:1 


709 


TT- 1 

H:l 


710 


T:2 


711 


C:l 


712 


G:l 


713 


Y:l 


714 


C:l 


715 


Y:] 


716 


Z:l 


717 


P:l 


TIG 

718 


Cj:1 


1 1 A 

719 


S:l 


720 


V.I 


721 


X>f * 1 


111 


1 :z 


723 


U:l, r.z, 0.2 


/24 


X.I 




T' 1 


726 




727 


1:1 


728 


T:l 


TOO 


r^.T* u-o- i^-o- AZ-i- Ar^-i 


730 


B:7; H:2; Y:l 


731 


B:5; W:3 


732 


B:1;C:2; G:2; S:2; AA:9 


733 


B:6; C:2; G:l; H:10; 0:2; P:6; Q:l; S:2; W:4; AC:2 


734 


B:6; 0:1; V:l 


735 


C:l;0:2 



wo 99/53051 



PCT/IB99/00712 



68 



/OO 


R'l' M-l* T''^* Y'l 


fj f 




/JO 






^'V r-R- D l- F-6- fi-l- H-11- M- J l- N l- 0*3- P12* 0'3' S-2* T'2' 
W:l; AC:1; AD:8 


740 


H:2; Y:l 


741 


C:2; H:l 


742 


B:12; C:l; G:l; H:4; K:2; 0:2; S:4; T:2; Y:2 


743 


AA:4 


744 


B:1;G:1;H:6; T:l; W:l 


745 


C:7; E:l; G:3; H:2; P:2; S:2; T:l; W:l; AD:2 


746 


0:2; S:l 


747 


T:2 


748 


S:3 


749 


H I- 0:2: S:2 


750 


Y:l; AD:1 


751 


B:8; G:2; H:2; 1:1; Q:2; S:2; T:l; W:2 


752 


T:3 




P:4 


754 


B:l; H:2 


7^5 


B-7' C- 1 • G-6* H'2- K- 1 • U*2 V 1 : Z: 1 


756 


C I- H I- J-2- 0-2- S l T-2- W:l: AA:1 


757 


B l- C I* K-3' S I' V I' Y l 


758 


E l- H'2' K l* P I* 01- AD'S 


759 


B'6- C I - Y l 


760 


B:4 


761 


W:2 


762 


B:3; C:7; H:9; N: 1 ; S: 1 ; T: 1 ; Y: 1 ; AA: 1 


763 


N:1;S:1;AA:5 


7iC>l 


Xl.J 




D,Df KJ.ly W.l 


/oo 


rl.z 


/o / 




7^C 
/Oo 


no* P-A- H*0- ^-l- T-1- V*l- AA-1 




A-i* vi'A' C'A' v-A' r;-i=;- u-in* v-o- n-R- p-?- i^-i* ^-R- T-J- W-'^- aa*7- 

A. 1, JtJ.^, y^.^f \J»Oy n. lu, Jv.z, vJ.o, 1 Jtv.i, o-o, 1./., vv ,_), 

AC:1 


770 


A:2; P:16; X:l 


771 


AA:3 


772 


0:4 


773 


B:l; C:l; W:l 


774 


P:2; X:4 


775 


B18- C*6* H-5* K-3' 0'7' S IO' W l* Y:3: Z:2: AA:2: AC:1 


776 


H:7 


111 


B:26; C:8; H:5; K:4; 0:10; S:!7; W:l; Y:4; Z:4; AA:4; AC:2 


11% 


B:6 


779 


B:3; C:J; G:l; H:2; K:l; Q:l; S:8; W:2; Y:9; AA:4 


780 


B:3;C:l;F:l;P:l; W:l; AC:1 


781 


1:2; N:1;P:1;R:3; AA:1 


782 


3:2 



wo 99/53051 



PCT/1B99/00712 



69 



783 


H:1;P:2; S:3; AD:1 


784 


H:1;P:1;S:4; AD:1 


785 


T:2 


786 


D:l; AC:9 


787 


H:l;L:l;S:l 


788 


B:6; S:4 




S:1;T:1 


700 


B l- C*2' H*5* W l- AD'l 


701 




70? 


B-3- D'l* K-2' S-2- Y l 




fr'?' AA-1 

VJ.X, rV/^* i 




r-4- D l- F l- F-^- G-6- J l- K-6- N l* O l* ?2' R l* S-3* T'2" 
W:2; X:l; Y:l; Z:l; AA:1; ACi2; AD:1 


795 


B:4; C:l; E:2; H:4; J:l; L:l; 0:4; S:l; V:l; Y:3; Z:l 


796 


H:5 


797 


B:2; E:1;N:2 


798 


B:1;G:1;H:6; T:l; W:l 


799 


H:2 


Qt\(\ 

oUU 


IT. 9. T.9. A A • 1 


801 


A:2; B:4; C:14; D:l; H:2; K:l; N:2; S:4; T:l; W:2; AA:20 


802 


AA:17 


803 


B:2; G:3; H:3; S:l; U:l; AC:1; AD:2 


804 


C:l; S:2;T:2; X:2; AA:1; AC:1 


805 


B:5; C:6; D:5; H:17; J:2; K:4; N:l; 0:6; P:2; S:5; T:5; W:l; X:l; Z:2; 


806 


B:2; C:3; D:3; H:6; J:2; K:l; N:l; 0:3; P:l; S:2; T:4; W:l; X:l; Z:l; 

A A A 1 


oU/ 




QAQ 

oUo 




fine 


p. -J. W-4 
rj. J, YV .H 


oiU 




fii 1 

oil 


P-fi* P-S- n-l- H'l- l^-S- n-7- r)'7- R-2- S'2- T-3- Y-4' z-2- AA-1* AC*r 
AD:2 


1600 


T:4 


iOvl 


rVrV. J 




\_^, J, n. I 




Ar*2 


1 OUH 


R-7- P-l - F-1- H I* P-7* T^-'^* T'2- Z'3* AA*2 


1 AO^ 


p'4' H '^- n- 1 


1 AO/^ 
iOUO 


A '^» nw r i4' n-?- F IO- F *^- G19- h*32* K ll- 0-5' P*2* R-3* S'16* 
T:4; W:2; Y:10; Z:8; AA:1; AC:3 


1607 


T:3 


1608 


B:3; P:2 


1609 


R:4 


1610 


B:4 


1611 


B:3;T:1 


1612 


T:2 


1613 


V:5 


1614 


D:3 
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1615 


AA:10 


1616 


B:4 


1617 


T:2 


1618 


K:2; S:8; AA:1 


1619 


B:2 


1620 


W:2 


1621 


H:l; AB:1 


1622 


H:2 



Table VI 



Tissue code 


Tissue type 


A 


Bone Marrow 


B 


Brain 


c 


Cancerous prostate 


D 


Cerebellum 


E 


Colon 


F 


Dystrophic muscle 


G 


Fetal brain 


H 


Fetal kidney 


I 


Fetal liver 


J 


Heart 


K 


Hypertrophic prostate 


L 


Kidney 


M 


Large intestine 


N 


Liver 


O 


Lung 


P 


Lymph ganglia 


Q 


Lymphocytes 


R 


Muscle 


S 


Prostate 


T 


Ovary 


U 


Pancreas 


V 


Placenta 


w 


Spinal cord 


X 


Spleen 


Y 


Substantia nigra 


Z 


Surrenals 


AA 


Testis 


AB 


Thyroid 


AC 


Umbilical cord 


AD 


Uterus 



5 In addition to categorizing the 5' ESTs and consensus contigated 5' ESTs with respect to their 

tissue of origin, the spatial and temporal expression patterns of the mRNAs corresponding to the 5* ESTs 
and consensus contigated S' ESTs, as well as their expression levels, may be determined as described in 
Example 18 below. 
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Characterization of the spatial and temporal expression patterns and expression levels of these 
mRNAs is useful for constructing expression vectors capable of producing a desired level of gene 
product in a desired spatial or temporal manner, as will be discussed in more detail below. 

Furthermore, 5' ESTs and consensus contigatcd 5' ESTs whose corresponding mRNAs are 
5 associated with disease states may also be identified. For example, a particular disease may result from 
the lack of expression, over expression, or under expression of a mRN A corresponding to a 5' EST or 
consensus contigated 5' EST. By comparing mRNA expression patterns and quantities in samples taken 
from healthy individuals with those from individuals suffering from a particular disease, 5' ESTs or 
consensus contigated 5* ESTs responsible for the disease may be identified. 
10 It will be appreciated that the results of the above characterization procedures for 5* ESTs and 

consensus contigated 5* ESTs also apply to extended cDNAs (obtainable as described below) which 
contain sequences adjacent to the 5' ESTs and consensus contigated 5' ESTs. It will also be appreciated 
that if desired, characterization may be delayed until extended cDNAs have been obtained rather than 
characterizing the 5' ESTs or consensus contigated 5' ESTs themselves. 

15 

EXAMPLE 18 
Evaluation of Expression Levels and Patterns of mRNAs 
Corresponding to EST-Related Nucleic Acids 
Expression levels and patterns of mRNAs corresponding to EST-related nucleic acids may be 
20 analyzed by solution hybridization with long probes as described in International Patent Application No. 
WO 97/05277. Briefly, an EST-related nucleic acid, fragment of an EST-related nucleic acid, positional 
segment of an EST-related nucleic acid, or fragment of a positional segment of an EST-related nucleic 
acid corresponding to the gene encoding the mRNA to be characterized is inserted at a cloning site 
immediately downstream of a bacteriophage (T3, T7 or SP6) RNA polymerase promoter to produce 
25 antisense RNA. Preferably, the EST-related nucleic acid, fragment of an EST-related nucleic acid, 

positional segment of an EST-related nucleic acid, or fragment of a positional segment of an EST-related 
nucleic acid is 100 or more nucleotides in length. The plasmid is linearized and transcribed in the 
presence of ribonucleotides comprising modified ribonucleotides {i.e. biotin-UTP and DIG-UTP). An 
excess of this doubly labeled RNA is hybridized in solution with mRNA isolated from cells or tissues of 
30 interest. The hybridizations are performed under standard stringent conditions (40-50**C for 1 6 hours in 
an 80% formamide, 0.4 M NaCI buffer, pH 7-8). The unhybridized probe is removed by digestion with 
ribonucleases specific for single-stranded RNA (te. RNases CL3, Tl, Phy M, U2 or A). The presence 
of the biotin-UTP modification enables capture of the hybrid on a microtitration plate coated with 
streptavidin. The presence of the DIG modification enables the hybrid to be detected and quantified by 
35 ELISA using an anti-DIG antibody coupled to alkaline phosphatase. 

The EST-related nucleic acid, fragment of an EST-related nucleic acid, positional segment of an 
EST-related nucleic acid, or fragment of a positional segment of an EST-related nucleic acid may also be 
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tagged with nucleotide sequences for the serial analysis of gene expression (SAGE) as disclosed in UK 
Patent Application No. 2 305 241 A. In this method, cDNAs are prepared from a cell, tissue, organism 
or other source of nucleic acid for which gene expression patterns must be determined. The resulting 
cDNAs are separated into two pools. The cDNAs in each pool are cleaved with a first restriction 
5 endonuclease, called an anchoring enzyme, having a recognition site which is likely to be present at least 
once in most cDNAs. The fragments which contain the 5' or 3' most region of the cleaved cDNA are 
isolated by binding to a capture medium such as streptavidin coated beads. A first oligonucleotide linker 
having a first sequence for hybridization of an amplification primer and an internal restriction site for a 
so called tagging endonuclease is ligated to the digested cDNAs in the first pool. Digestion with the 

1 0 second endonuclease produces short tag fragments from the cDNAs. 

A second oligonucleotide having a second sequence for hybridization of an amplification primer 
and an internal restriction site is ligated to the digested cDNAs in the second pool. The cDNA fragments 
in the second pool are also digested with the tagging endonuclease to generate short tag fragments 
derived from the cDNAs in the second pool. The tags resulting from digestion of the first and second 

1 5 pools with the anchoring enzyme and the tagging endonuclease are ligated to one another to produce so 
called ditags. In some embodiments, the ditags are concatamerized to produce ligation products 
containing from 2 to 200 ditags. The tag sequences are then determined and compared to the sequences 
of the EST-related nucleic acid, fragment of an EST-related nucleic acid, positional segment of an EST- 
related nucleic acid, or fragment of a positional segment of an EST-related nucleic acid to determine 

20 which 5' ESTs, consensus contigated 5' ESTs, or extended cDNAs are expressed in the cell, tissue, 
organism, or other source of nucleic acids from which the tags were derived. In this way, the expression 
pattern of the 5' ESTs, consensus contigated 5' ESTs, or extended cDNAs in the cell, tissue, organism, 
or other source of nucleic acids is obtained. 

Quantitative analysis of gene expression may also be performed using arrays. As used herein, 

25 the term array means a one dimensional, two dimensional, or multidimensional arrangement of EST- 
related nucleic acids, fragments of EST-related nucleic acids, positional segments EST-related nucleic 
acids, or firagments of positional segments of EST-related nucleic acids. Preferably, the EST-related 
nucleic acids, Augments of EST-related nucleic acids, positional segments EST-related nucleic acids, or 
fragments of positional segments of EST-related nucleic acids are at least 10, 12, 15, 18, 20, 23, 25, 28, 

30 30, 35, 40, or 50 nucleotides in length. More preferably, the EST-related nucleic acids, fi^gments of 
EST-related nucleic acids, positional segments EST-related nucleic acids, or fragments of positional 
segments of EST-related nucleic acids are at least 100 nucleotide long. More preferably, the fragments 
are more than 100 nucleotides in length. In some embodiments, the EST-related nucleic acids, 
fragments of EST-related nucleic acids, positional segments EST-related nucleic acids, or fragments of 

35 positional segments of EST-related nucleic acids may be more than 500 nucleotides long. 

For example, quantitative analysis of gene expression may be performed with EST-rclated 
nucleic acids, fiagments of EST-related nucleic acids, positional segments EST-related nucleic acids, or 
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fragments of positional segments of EST-related nucleic acids in a complementary DNA microarray as 
described by Schena et al {Science HQA^H-AI^^, 1995; Proc, Nati Acad. Sci. U.S.A. 93: 10614-10619, 
1996). EST-related nucleic acids, fragments of EST-related nucleic acids, positional segments EST- 
related nucleic acids, or fragments of positional segments of EST-related nucleic acids are amplified by 

5 PGR and arrayed from 96-well microtiter plates onto silylated microscope slides using high-speed 

robotics. Printed arrays are incubated in a humid chamber to allow rehydration of the array elements and 
rinsed, once in 0.2% SDS for 1 min, twice in water for 1 min and once for 5 min in sodium borohydride 
solution. The an^ays are submerged in water for 2 min at 9S^C, transferred into 0.2% SDS for 1 min, 
rinsed twice with water, air dried and stored in the dark at 25°C. 

1 0 Cell or tissue mRNA is isolated or commercially obtained and probes are prepared by a single 

round of reverse transcription. Probes are hybridized to 1 cm^ microarrays under a 14 x 14 mm glass 
coverslip for 6-12 hours at 60X. Arrays are washed for 5 min at 25**C in low stringency wash buffer ( 1 
X SSC/0.2% SDS), then for 10 min at room temperature in high stringency wash buffer (0.1 x SSC/0.2% 
SDS). Arrays are scanned in 0.1 x SSC using a fluorescence laser scanning device fitted with a custom 

1 5 filter set. Accurate differential expression measurements are obtained by taking the average of the ratios 
of two independent hybridizations. 

Quantitative analysis of the expression of genes may also be performed with EST-related 
nucleic acids, firagments of EST-related nucleic acids, positional segments EST-related nucleic acids, or 
fragments of positional segments of EST-related nucleic acids in complementary DNA arrays as 

20 described by Pietu et al {Genome Research 6:492-503, 1 996). The EST-related nucleic acids, fragments 
of EST-related nucleic acids, positional segments EST-related nucleic acids, or fragments of positional 
segments of EST-related nucleic acids thereof are PGR amplified and spotted on membranes. Then, 
mRNAs originating from various tissues or cells are labeled with radioactive nucleotides. After 
hybridization and washing in controlled conditions, the hybridized mRNAs are detected by phospho- 

25 imaging or autoradiography. Ehiplicate experiments are performed and a quantitative analysis of 
differentially expressed mRNAs is then performed. 

Alternatively, expression analysis of the EST-related nucleic acids, fragments of EST-related 
nucleic acids, positional segments EST-related nucleic acids, or fragments of positional segments of 
EST-related nucleic acids can be done through high density nucleotide arrays as described by Lockhart 

30 et al. {Nature Biotechnology 14: 1675-1680, 1996) and Sosnowsky et at. {Proc. Nati Acad ScL 

94:1 119-1 123, 1997). Ohgonucleotides of 15-50 nucleotides corresponding to sequences of EST-related 
nucleic acids, fragments of EST-related nucleic acids, positional segments EST-related nucleic acids, or 
fragments of positional segments of EST-related nucleic acids are synthesized directly on the chip 
(Lockhart et al., supra) or synthesized and then addressed to the chip (Sosnowsky et al.y supra). 

35 Preferably, the oligonucleotides are about 20 to 25 nucleotides in length. 

cDNA probes labeled with an appropriate compound, such as biotin, digoxigenin or fluorescent 
dye, are synthesized from the appropriate mRNA population and then randomly fragmented to an 
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average size of 50 to 100 nucleotides. The said probes are then hybridized to the chip. After washing as 
described in Lockhart et al, supra and application of different electric fields (Sonowsky et al, supra,), the 
dyes or labeling compounds are detected and quantified. Duplicate hybridizations are performed. 
Comparative analysis of the intensity of the signal originating from cDNA probes on the same target 
5 oligonucleotide in different cDNA samples indicates a differential expression of the mRNA 
corresponding to the 5' EST, consensus contigated 5' EST or extended cDNA from which the 
oligonucleotide sequence has been designed. 

IV. Use of 5' ESTs to Clone Extended cDNAs and to Clone the Corresponding Genomic DNAs 

1 0 Once 5' ESTs or consensus contigated 5' ESTs which include the 5' end of the corresponding 

mRNAs have been selected using the procedures described above, they can be utilized to isolate 
extended cDNAs which contain sequences adjacent to the 5' ESTs or consensus contigated 5' ESTs. 
The extended cDNAs may include the entire coding sequence of the protein encoded by the 
corresponding mRNA, including the authentic translation start site. If the extended cDNA encodes a 

1 5 secreted protein, it may contain the signal sequence, and the sequence encoding the mature protein 
remaining after cleavage of the signal peptide. 

Extended cDNAs which include the entire coding sequence of the protein encoded by the 
corresponding mRNA are referred to herein as "fiill-Iength cDNAs." Alternatively, the extended cDNAs 
may not include the entire coding sequence of the protein encoded by the corresponding mRNA, 

20 although they do include sequences adjacent to the 5'ESTs or consensus contigated 5' ESTs. In some 
embodiments in which the extended cDNAs are derived from an mRNA encoding a secreted protein, the 
extended cDNAs may include only the sequence encoding the mature protein remaining after cleavage 
of the signal peptide, or only the sequence encoding the signal peptide. 

Examples 19 and 20 below describe a general method for obtaining extended cDNAs using 5' 

25 ESTs or consensus contigated 5' ESTs and nucleic acid homologous thereto. Example 21 below 

describes the cloning and sequencing of several extended cDNAs, including ftilMength cDNAs which 
include the authentic 5' end of the corresponding mRNA for several secreted proteins. 

The methods of Examples 19 and 20 can also be used to obtain extended cDNAs which encode 
less than the entire coding sequence of proteins encoded by the genes corresponding to the 5' ESTs or 

30 consensus contigated 5'ESTs. In some embodiments, the extended cDNAs isolated using these methods 
encode at least 5,10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids of one of the 
proteins encoded by the sequences of SEQ ID NOs. 24-8 1 1 and 1 600-1 622. In some embodiments, the 
extended cDNAs isolated using these methods encode at least 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 
or 150 consecutive amino acids of one of the proteins encoded by the sequences of SEQ ID NOs. 24- 

35 811. 



EXAMPLE 19 
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General Method for Using 5' ESTs or Consensus ContiRated 5'ESTs to Clone and Sequence Extended 
cDNAs which Include the Entire Coding Region and the Authentic 5'End of the Correspondinu mRNA 

The following general method may be used to quickly and efficiently isolate extended cDNAs 
including sequence adjacent to the sequences of the 5' ESTs or Consensus Contigated 5'ESTs used to 
5 obtain them. This method may be applied to obtain extended cDNAs for any 5' EST or consensus 
contigated 5' EST of the invention, including those 5' ESTs and consensus contigated 5* ESTs encoding 
secreted proteins. This method is illustrated in Figure 3. 
1 . Obtaining Extended cDNAs 

The method takes advantage of the known 5' sequence of the mRNA. A reverse transcription 
10 reaction is conducted on purified mRNA with a poly dT primer containing a nucleotide sequence at its 5' 
end allowing the addition of a known sequence at the end of the cDNA which corresponds to the 3' end 
of the mUNA. Such a primer and a commercially-available reverse transcriptase enzyme are added to a 
buffered mRNA sample yielding a reverse transcript anchored at the 3' f)oIyA site of the RNAs. 
Nucleotide monomers are then added to complete the first strand synthesis. 
1 5 After removal of the mRNA hybridized to the first cDNA strand by alkaline hydrolysis, the 

products of the alkaline hydrolysis and the residual poly dT primer can be eliminated with an exclusion 
column. 

Subsequently, a pair of nested primers on each end is designed based on the known 5' sequence 
ftom the 5' EST or consensus contigated 5' EST and the known 3' end added by the poly dT primer 

20 used in the first strand synthesis. Software used to design primers are either based on GC content and 
melting temperatures of oligonucleotides, such as OSP (Illier and Green, PCRMeth. Appl 1:124-128, 
1 99 1 ), or based on the octamer frequency disparity method (Griffais et al , Nucleic Acids Res. 19: 3887- 
3891, 1991 ) such as PC-Rare (http:// bioinformatics.weizmann.acjl/software/PC- 
Rare/doc/manuel.html). Preferably, the nested primers at the 5' end and the nested primers at the 3' end 

25 are separated from one another by four to nine bases. These primer sequences may be selected to have 
melting temperatures and specificities suitable for use in PGR. 

A first PCR run is performed using the outer primer from each of the nested pairs. A second 
PCR run using the inner primer from each of the nested pairs is then performed on a small sample of the 
first PCR product. Thereafter, the primers and remaining nucleotide monomers are removed. 

30 2. Sequencing Extended cDNAs or Fragments Thereof 

Due to the lack of position constraints on the design of 5' nested primers compatible for PCR 
use using the OSP software, amplicons of two types are obtained. Preferably, the second 5' primer is 
located upstream of the translation initiation codon thus yielding a nested PCR product containing the 
entire coding sequence. Such an extended cDNA may be used in a direct cloning procedure as described 

35 in section a below. However, in some cases, the second 5' primer is located downstream of the 
translation initiation codon, thereby yielding a PCR product containing only part of the ORF. Such 
incomplete PCR products are submitted to a modified procedure described in section b below. 



wo 99/53051 PCT/1B99/00712 

76 

a) Nested PCR products containing complete ORFs 

When the resuhing nested PCR product contains the complete coding sequence, as predicted 
from the 5'EST or consensus contigated 5' EST sequence, it is directly cloned in an appropriate vector 
as described in section 3. 
5 b) Nested PCR products containing incomplete ORFs 

When the amplicon does not contain the complete coding sequence, intermediate steps are 
necessary to obtain both the complete coding sequence and a PCR product containing the full coding 
sequence. The complete coding sequence can be assembled from several partial sequences 
determined directly from different PCR products. 
10 Once the full coding sequence has been completely determined, new primers compatible for 

PCR use are then designed to obtain amplicons containing the whole coding region. However, in 
such cases, 3' primers compatible for PCR use are located inside the 3' UTR of the corresponding 
mRNA, thus yielding amplicons which lack part of this region, i.e. the polyA tract and sometimes the 
polyadenylation signal, as illustrated in Figure 3, Such extended cDNAs are then cloned into an 
1 5 appropriate vector as described in section 3. 
c) Sequencing extended cDNAs 

Sequencing of extended cDNAs can be performed using a Die Terminator approach with the 
AmpliTaq DNA polymerase FS kit available from Perkin Elmer. 

In order to sequence long PCR fragments, primer walking is performed using software such as 
20 OSP to choose primers and automated computer software such as ASMG (Sutton et ai. Genome Science 
TechnoL 1: 9-19, 1995) to construct contigs of walking sequences including the initial 5' tag. Preferably, 
primer walking is performed until the sequences of fiill length cDNAs are obtained. 

Completion of the sequencing of a given extended cDN A fragment may be assessed by 
comparing the sequence length to the size of the corresponding nested PCR product. When Northern 
25 blot data are available, the size of the mRNA detected for a given PCR product may also be used to 
finally assess that the sequence is complete. Sequences which do not fulfill these criteria are discarded 
and will imdergo a new isolation procedure. 
3. Cloning Extended cDNAs 

The PCR product containing the full coding sequence is then cloned in an appropriate vector. 
30 For example, the extended cDNAs can be cloned into any expression vector known in the art, such as 
pED6dpc2 (DiscoverEase, Genetics Institute, Cambridge, MA). 

Cloned PCR products are then entirely sequenced in order to obtain at least two sequences 
per clone. Preferably, the sequences are obtained from both sense and antisense strands according to 
the aforementioned procedure with the following modifications. First, both 5' and 3' ends of cloned 
35 PCR products are sequenced in order to confirm the identity of the clone. Second, primer walking is 
performed if the full coding coding region has not been obtained yet. Contigation is then performed 
using primer walking sequences for cloned products as well as walking sequences that have already 
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contigated for uncloned PCR products. The sequence is considered complete when the resulting 
contigs include the whole coding region as well as overlapping sequences with vector DNA on both 
ends. All the contigated sequences for each cloned amplicon are then used to obtain a consensus 
sequence. 

5 4. Selection of Cloned Full length Sequences 

a) Computer analysis of extended cDNAs 

Following identification of contaminants and masking of repeats, structural features, e.g. 
polyA tail and polyadenylation signal, of the sequences of extended cDNAs are subsequently 
determined using methods known to those skilled in the art. For example, algorithm, parameters and 

10 criteria defined in Figure 10 may be used. Briefly, a polyA tail is defined as a homopolymeric stretch 
of at least 1 1 A with at most one alternative base within it. The polyA tail search is restricted to the 
last 20 nucleotides of the sequence and limited to stretches of 1 1 consecutive A's because sequencing 
reactions are often not readable after such a polyA stretch. To search for a polyadenylation signal, 
the polyA tail is clipped ft-om the full-length sequence. The 50 nucleotides preceding the polyA tail 

1 5 are searched for the canonic polyadenylation AAUAAA signal allowing one mismatch to account for 
possible sequencing errors as well as known variation in the canonical sequence of the 
polyadenylation signal. 

Functional features, e.g. ORFs and signal sequences, of the sequences of extended cDNAs are 
subsequently determined as follows. The 3 upper strand frames of extended cDNAs are searched for 

20 ORFs defined as the maximum length fragments beginning with a translation initiation codon and 
ending with a stop codon. ORFs encoding at least 80 amino acids are preferred. If extended cDNAs 
encoding secreted proteins are desired, each found ORF is then scanned for the presence of a signal 
peptide using the matrix method described in Example 13. 

Sequences of extended cDNAs are then compared, on a nucleotidic or proteic basis, to public 

25 sequences available at the time of filing. 

b) Selection of full-length cDNAs of interest 

A negative selection may then be performed in order to eliminate unwanted cloned sequences 
resulting from either contaminants or PCR artifacts as follows. Sequences matching contaminant 
sequences such as vector DNA, tRNA, mtRNA, rRNA sequences are discarded as well as those 

30 encoding ORF sequences exhibiting extensive homology to repeats. Sequences obtained by direct 

cloning (section la) but lacking polyA tail may be discarded. Only ORFs ending either before the polyA 
tail (section 1 a) or before the end of the cloned 3'UTR (section I b) may be selected. If extended cDN As 
encoding secreted proteins arc desired, ORFs containing a signal peptide are considered. In addition, 
ORFs containing unlikely mature proteins such as mature proteins which size is less than 20 amino acids 

35 or less than 25% of the immature protein size may be eliminated. 

Then, for each remaining full length cDNA containing several ORFs, a preselection of ORFs 
may be performed using the following criteria. The longest ORF is preferred. If extended cDN As 
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encoding secreted proteins are desired and if the ORF sizes are similar, the chosen ORF is the one which 
signal peptide has the highest score according to Von Heijne method. 

Sequences of full length cDNA clones may then be compared pairwise after masking of the 
repeat sequences. Full-length cDNA sequences exhibiting extensive homology may be clustered in the 
5 same class. Each cluster may then be subjected to a cluster analysis that detects sequences resulting 
from internal priming or from alternative splicing, identical sequences or sequences with several 
frameshifts. A selection may be operated between clones belonging to the same class in order to detect 
clones encoding homologous but distinct ORFs which may be both selected if they both contain 
sequences of interest. 

1 0 Selection of full-length cDNA clones encoding sequences of interest may subsequently be 

performed using the following criteria. Structural parameters (initial tag, polyadenylation site and 
signal) are first checked. Then, homologies with known nucleic acids and proteins are examined in 
order to determine whether the clone sequence match a known nucleotide/protein sequence and, in the 
latter case, its covering rate and the date at which the sequence became public. If there is no extensive 

1 5 match with sequences otiier than ESTs or genomic DNA, or if the clone sequence brings substantial new 
information, such as encoding a protein resulting from ahemative splicing of an mRNA coding for an 
already known protein, the sequence is kept. Examples of such cloned full-length cDNAs containing 
sequences of interest are described in Example 21 . Sequences resulting from chimera or double inserts 
or located on chromosome breaking points as assessed by homology to other sequences may be 

20 discarded during this procedure. 

Extended cDNAs prepared as described above may be subsequently engineered to obtain 
nucleic acids which include desired portions of the extended cDNA using conventional techniques such 
as subcloning, PGR, or in vitro oligonucleotide synthesis. For example, nucleic acids which include only 
the full coding sequences may be obtained using techniques known to those skilled in the art. 

25 Alternatively, conventional techniques may be applied to obtain nucleic acids which contain only part of 
the coding sequences. In the case of nucleic acids encoding secreted proteins, nucleic acids containing 
only the coding sequence for the mature protein remaining after the signal peptide is cleaved offer 
nucleic acids which contain only the coding sequences for the signal peptides may be obtained. 

Similariy, nucleic acids containing any other desired portion of the coding sequences for the 

30 encoded protein may be obtained. For example, the nucleic acid may contain at least 10, 15, 18, 20, 25, 
28, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400 or 500 consecutive bases of an extended cDNA. 

Once an extended cDNA has been obtained, it can be sequenced to determine the amino acid 
sequence it encodes. Once the encoded amino acid sequence has been determined, one can create and 
identify any of the many conceivable cDNAs that will encode that protein by simply using the 

35 degeneracy of the genetic code. For example, allehc variants or other homologous nucleic acids can be 
identified as described below. Altematively, nucleic acids encoding the desired amino acid sequence can 
be synthesized in vitro. 
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In a preferred embodiment, the coding sequence may be selected using the known codon or 
codon pair preferences for the host organism in which the cDNA is to be expressed. 

In addition to PGR based methods for obtaining cDNAs which include the authentic 5 'end of 
the corresponding mRNA as well as the complete protein coding sequence of the corresponding mRNA, 
5 traditional hybridization based methods may also be employed. These methods may also be used to 
obtain the genomic DNAs which encode the mRNAs from which the 5' ESTs or consensus contigated 5' 
ESTS were derived, mRNAs corresponding to the extended cDNAs, or nucleic acids which are 
homologous to extended cDNAs, 5' ESTs, or consensus contigated 5' ESTs. Example 19 below 
provides examples of such methods. 

10 

EXAMPLE 20 

Methods for Obtaining Extended cDNAs which Include the Entire Coding Region and the Authentic 
5*End of the Corresponding mRNA or Nucleic Acids Homologous to Extended cDNAs. 5' ESTs or 

Consensus Contigated 5' ESTs 
1 5 A full-length cDNA library can be made using the strategies described in Example 7. 

Alternatively, a cDNA library or genomic DNA library may be obtained from a commercial source or 
made using techniques familiar to those skilled in the art. 

Such cDNA or genomic DNA libraries may be used to isolate extended cDNAs obtained from 
5' ESTs or consensus contigated 5' ESTs or nucleic acids homologous to extended cDNAs, 5* ESTs, or 
20 consensus contigated 5* ESTs as follows. The cDNA library or genomic DNA library is hybridized to a 
detectable probe. The detectable probe may comprise at least 10, 15, 18, 20, 25, 28, 30, 35, 40, 50, 75, 
100, 1 50, 200, 300, 400 or 500 consecutive nucleotides of the 5* EST, consensus contigated 5' EST, or 
extended cDNA. 

Techniques for identifying cDNA clones in a cDNA library which hybridize to a given probe 
25 sequence are disclosed in Sambrook et al. Molecular Cloning: A Laboratory Manual 2d Ed., Cold 
Spring Harbor Laboratory Press, 1989. The same techniques may be used to isolate genomic DNAs. 
Briefly, cDNA or genomic DNA clones which hybridize to the detectable jwobe are identified and 
isolated for further manipulation as follows. The detectable probe described in the preceding paragraph 
is labeled with a detectable label such as a radioisotope or a fluorescent molecule. Techniques for 
30 labeling the probe are well known and include phosphorylation with polynucleotide kinase, nick 

translation, in vitro transcription, and non radioactive techniques. The cDNAs or genomic DNAs in the 
library are transferred to a nitrocellulose or nylon filter and denatured. After blocking of non specific 
sites, the filter is incubated with the labeled probe for an amount of time sufficient to allow binding of 
the probe to cDNAs or genomic DNAs containing a sequence capable of hybridizing thereto. 
35 By varying the stringency of the hybridization conditions used to identify cDNAs or genomic 

DNAs which hybridize to the detectable probe, cDNAs or genomic DNAs having different levels of 
homology to the probe can be identified and isolated as described below. 
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1. Identification of cDNA or Genomic DNA Sequences Having a Hijgh Degree of Homology to the 
Labeled Probe 

To identify cDNAs or genomic DNAs having a high degree of homology to the probe sequence, 
the melting temperature of the probe may be calculated using the following formulas: 
5 For probes between 14 and 70 nucleotides in length the melting temperature (Tm) is calculated 

using the formula: Tm=81 .5+1 6.6(log (Na+))+0.41 (fraction G+CH600/N) where N is the length of the 
probe. 

If the hybridization is carried out in a solution containing formamide, the melting temperature 
may be calculated using the equation Tm=81 .5+16.6(log (Na+))+0.4l(fmction G+C)-(0.63% 

1 0 formamide)-(600/N) where N is the length of the probe. 

Prehybridization may be carried out in 6X SSC, 5X Denhardt's reagent, 0.5% SDS, 100 ^g 
denatured fragmented salmon sperm DNA or 6X SSC, 5X Denhardt's reagent, 0.5% SDS, 100 pg 
denatured fragmented salmon sperm DNA, 50% formamide. The formulas for SSC and Denhardt's 
solutions are listed in Sambrook et al, supra. 

15 Hybridization is conducted by adding the detectable probe to the prehybridization solutions 

listed above. Where the probe comprises double stranded DNA, it is denatured before addition to the 
hybridization solution. The filter is contacted with the hybridization solution for a sufficient period of 
time to allow the probe to hybridize to extended cDNAs or genomic DNAs containing sequences 
complementary thereto or homologous thereto. For probes over 200 nucleotides in length, the 

20 hybridization may be carried out at 1 5-25°C below the Tm. For shorter probes, such as oligonucleotide 
probes, the hybridization may be conducted at 15-25''C below the Tm. Preferably, for hybridizations in 
6X SSC, the hybridization is conducted at approximately 68''C. Preferably, for hybridizations in 50% 
formamide containing solutions, the hybridization is conducted at approximately 42°C. 

All of the foregoing hybridizations would be considered to be under "stringenf conditions. 

25 Following hybridization, the filter is washed in 2X SSC, 0.1% SDS at room temperature for 15 

minutes. The filter is then washed with O.IX SSC, 0.5% SDS at room temperature for 30 minutes to 1 
hour. Thereafter, the solution is washed at the hybridization temperature in O.IX SSC, 0.5% SDS. A 
final wash is conducted in O.IX SSC at room temperature. 

cDNAs or genomic DNAs which have hybridized to the probe are identified by autoradiography 

30 or other conventional techniques. 

2. Obtaining cDNA or Genomic DNA Sequences Having Lower Degrees of Homoloev to the 
Labeled Probe 

The above procedure may be modified to identify cDNAs or genomic DNAs having decreasing 
levels of homology to the probe sequence. For exan^le, to obtain cDNAs or genomic DNAs of 
35 decreasing homology to the detectable probe, less stringent conditions may be used. For example, the 
hybridization temperature may be decreased in increments of 5**C from 68°C to 42°C in a hybridization 
buffer having a sodium concentration of approximately IM. Following hybridization, the filter may be 
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washed with 2X SSC, 0.5% SDS at the temperature of hybridization. These conditions are considered to 
be "moderate" conditions above 50°C and "low" conditions below 50^C. 

Alternatively, the hybridization may be carried out in buffers, such as 6X SSC, containing 
formamide at a temperature of 42^C. In this case, the concentration of formamide in the hybridization 
5 buffer may be reduced in 5% increments from 50% to 0% to identify clones having decreasing levels of 
homology to the probe. Following hybridization, the filter may be washed with 6X SSC, 0.5% SDS at 
50X. These conditions are considered to be "moderate" conditions above 25% formamide and "low" 
conditions below 25% formamide. cDNAs or genomic DNAs which have hybridized to the probe are 
identified by autoradiography. 

10 3. Determination of the Deeree of Homology between the Obtained cDNAs or Genomic DNAs and 
5'ESTs. Consensus Contieated 5'ESTs> or Extended cDNAs or Between the Polypeptides Encoded 
by the Obtained cDNAs or Genomic DNAs and the Polypeptides Encoded by the 5'ESTs. Consensus 
Contipated 5'ESTs, or Extended cDNAs 

To detemiine the level of homology between the hybridized cDNA or genomic DNA and the 

1 5 5'EST, consensus contigated 5'EST or extended cDNA from which the probe was derived, the 
nucleotide sequences of the hybridized nucleic acid and the 5'EST, consensus contigated 5*EST or 
extended cDNA from which the probe was derived are compared. The sequences of the 5'EST, 
consensus contigated 5'EST or extended cDNA from which the probe was derived and the sequences of 
the cDNA or genomic DNA which hybridized to the detectable probe may be stored on a computer 

20 readable medium as described below and compared to one another using any of a variety of algorithms 
familiar to those skilled in the art, those described below. 

To determine the level of homology between the polypeptide encoded by the hybridizing cDNA 
or genomic DNA and the polypeptide encoded by the 5'EST, consensus contigated 5'EST or extended 
cDNA from which the probe was derived, the polypeptide sequence encoded by the hybridized nucleic 

25 acid and the polypeptide sequence encoded by the 5'EST, consensus contigated 5'EST or extended 
cDNA from which the probe was derived are compared. The sequences of the polypeptide encoded by 
the 5'EST, consensus contigated 5'EST or extended cDNA from which the probe was derived and the 
polypeptide sequence encoded by the cDNA or genomic DNA which hybridized to the detectable probe 
may be stored on a computer readable medium as described below and compared to one another using 

30 any of a variety of algorithms familiar to those skilled m the art, those described below. 

Protein and/or nucleic acid sequence homologies may be evaluated using any of the variety 
of sequence comparison algorithms and programs known in the art. Such algorithms and programs 
include, but are by no means limited to, TBLASTN, BLASTP, FASTA, TFASTA, and CLUSTALW 
(Pearson and Lipman, 1 988, Proc. Nati Acad, Scl USA 55(^*;:2444-2448; Altschul et al, 1990, / 

35 Mol BioL 2;5^j;:403-410; Thompson et al, 1994, Nucleic Acids Res. 22^2;:4673-4680; Higgins et 
al, 1996, Methods Enzymol. 2(5(5:383^02; Altschul et al, 1990, 1 MoL Biol 2;5ri;:403-410; 
Altschul et ai, 1993, Nature Genetics 5:266-272). 
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In a particularly preferred embodiment, protein and nucleic acid sequence homologies are 
evaluated using the Basic Local Alignment Search Tool ("BLAST") which is well known in the art 
(see, e.g., Karlin and Altschul, 1990, Proc. Natl. Acad. ScL USA 57:2267-2268; Altschul ei aL, 1990, 
J. Mol. BioL 2/5:403-410; Altschul et aL, 1993, Nature Genetics 5:266-272; Altschul et aL, 1997, 
5 Nuc. Acids Res. 25:3389-3402). In particular, five specific BLAST programs are used to perform the 
following task: 



(1) BLASTP and BLAST3 compare an amino acid query sequence against a protein 

sequence database; 

(2) BLASTN compares a nucleotide query sequence against a nucleotide sequence 
10 database; 

(3) BLASTX compares the six-frame conceptual translation products of a query 

nucleotide sequence (both strands) against a protein sequence database; 

(4) TBLASTN compares a query protein sequence against a nucleotide sequence 

database translated in all six reading frames (both strands); and 
15 (5) TBLASTX compares the six-frame translations of a nucleotide query sequence 

against the six-frame translations of a nucleotide sequence database. 



The BLAST programs identify homologous sequences by identifying similar segments, 
which are referred to herein as "high-scoring segment pairs,'* between a query amino or nucleic acid 
sequence and a test sequence which is preferably obtained from a protein or nucleic acid sequence 

20 database. High-scoring segment pairs are preferably identified {i.e., aligned) by means of a scoring 
matrix, many of which are known in the art. Preferably, the scoring matrix used is the BLOSUM62 
matrix (Gonnet a/., 1992, Science 25 6:1443 A445; Henikoff and Henikoff, \993,Proteins 77:49- 
61). Less preferably, the PAM or PAM250 matrices may also be used (see, e.g., Schwartz and 
Dayhoff, eds., 1978, Matrices for Detecting Distance Relationships: Atlas of Protein Sequence and 

25 Structure, Washington: National Biomedical Research Foundation) 

The BLAST programs evaluate the statistical significance of all high-scoring segment pairs 
identified, and preferably selects those segments which satisfy a user-specified threshold of 
significance, such as a user-specified percent homology. Preferably, the statistical significance of a 
high-scoring segment pair is evaluated using the statistical significance formula of Karlin (see, e.g., 

30 Karlin and Altschul, 1990, Proc. NatL Acad ScL USA 57:2267-2268). 

The parameters used with the above algorithms may be adapted depending on the sequence 
length and degree of homology studied. In some embodiments, the parameters may be the default 
parameters used by the algorithms in the absence of instructions from the user. 

In some embodiments, the level of homology between the hybridized nucleic acid and the 

35 extended cDNA, 5*EST, or 5* consensus contigated 5'EST from which the probe was derived may be 
determined using the FASTDB algorithm described in Brutlag et aL Comp. Ap^. Biosci. 6:237-245, 
1990. In such analyses the parameters may be selected as follows: Matrix=Unitary, k-tuple=4, 
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Mismatch Penalty=l, Joining Penalty=30, Randomization Group Length=0, CutofTScore=l, Gap 
Penalty=5, Gap Size Penalty=0.05, Window Size=500 or the length of the sequence which hybridizes to 
the probe, whichever is shorter. Because the FASTDB program does not consider 5' or 3' truncations 
when calculating homology levels, if the sequence which hybridizes to the probe is truncated relative to 
5 the sequence of the extended cDNA, 5 'EST, or consensus contigated 5'EST from which the probe was 
derived the homology level is manually adjusted by calculating the number of nucleotides of the 
extended cDNA, 5'EST, or consensus contigated 5' EST which are not matched or aligned with the 
hybridizing sequence, determining the percentage of total nucleotides of the hybridizing sequence which 
the non-matched or non-aligned nucleotides represent, and subtracting this percentage from the 

10 homology level. For example, if the hybridizing sequence is 700 nucleotides in length and the extended 
cDNA, 5'EST, or consensus contigated 5' EST sequence is 1000 nucleotides in length wherein the first 
300 bases at the 5'Dend of the extended cDNA, 5'EST, or consensus contigated 5' EST are absent from 
the hybridizing sequence, and wherein the overlapping 700 nucleotides are identical, the homology level 
would be adjusted as follows. The non-matched, non-aligned 300 bases represent 30% of the length of 

15 the extended cDN \, 5'EST, or consensus contigated 5' EST. If the overiapping 700 nucleotides are 
100% identical, th adjusted homology level would be 100-30=70% honiology. It should be noted that 
the preceding adjustments are only made when the non-matched or non-aligned nucleotides are at the 
5'or 3'ends. No adjustments are made if the non-matched or non-aligned sequences are internal or under 
any other conditions. 

20 For example, using the above methods, nucleic acids having at least 95% nucleic acid 

homology, at least 96% nucleic acid homology, at least 97% nucleic acid homology, at least 98% nucleic 
acid homology, at least 99% nucleic acid homology, or more than 99% nucleic acid homology to the 
extended cDNA, 5'EST, or consensus contigated 5' EST from which the probe was derived may be 
obtained and identified. Such nucleic acids may be allelic variants or related nucleic acids from other 

25 species. Similarly, by using progressively less stringent hybridization conditions one can obtain and 
identify nucleic acids having at least 90%, at least 85%, at least 80% or at least 75% homology to the 
extended cDNA, 5'EST, or consensus contigated 5' EST from which the probe was derived. 

Using the above methods and algorithms such as FASTA with parameters depending on the 
sequence length and degree of homology studied, for example the default parameters used by the 

30 algorithms in the absence of instructions from the user, one can obtain nucleic acids eiKoding proteins 
having at least 99%, at least 98%, at least 97%, at least 96%, at least 95%, at least 90%, at least 85%, at 
least 80% or at least 75% homology to the protein encoded by the extended cDNA. 5'EST, or consensus 
contigated 5' EST from which the probe was derived. In some embodiments, the homology levels can 
be determined using the "default" opening penalty and the "default" gap penalty, and a scoring matrix 

35 such as PAM 250 (a standard scoring matrix; see Dayhoff et al , in: Atlas of Protein Sequence and 
Structure, Vol. 5, Supp. 3 ( 1 978)). 
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Alternatively, the level of polypeptide homology may be determined using the FASTDB 
algorithm described by Brutlag et al Comp. App. Biosci. 6:237-245, 1990. In such analyses the 
parameters may be selected as follows: Matrix=PAM 0, k-tuple=2, Mismatch Penalty=l, Joining 
Penalty=20, Randomization Group Length=0, Cutoff Score=l, Window Size=Sequence Length, Gap 
5 Penalty=5, Gap Size Penalty==0.05, Window Size=500 or the length of the homologous sequence, 
whichever is shorter. If the homologous amino acid sequence is shorter than the amino acid sequence 
encoded by the extended cDNA, 5'EST, or consensus contigated 5' EST as a resuh of an N terminal 
and/or C terminal deletion the results may be manually corrected as follows. First, the number of amino 
acid residues of the amino acid sequence encoded by the extended cDNA, 5'EST, or consensus 

1 0 contigated 5' EST which are not matched or aligned with the homologous sequence is determined. 

Then, the percentage of the length of the sequence encoded by the extended cDNA, 5 'EST, or consensus 
contigated 5' EST which the non-matched or non-aligned amino acids represent is calculated. This 
percentage is subtracted from the homology level. For example wherein the amino acid sequence 
encoded by the extended cDNA, 5'EST, or consensus contigated 5' EST is 100 amino acids in length 

1 5 and the length of the homologous sequence is 80 amino acids and wherein the amino acid sequence 
encoded by the extended cDNA or 5'EST is truncated at the N terminal end with respect to the 
homologous sequence, the homology level is calculated as follows. In the preceding scenario there are 
20 non-matched, non-aligned amino acids in the sequence encoded by the extended cDNA, 5'EST, or 
consensus contigated 5' EST. This represents 20% of the length of the amino acid sequence encoded by 

20 the extended cDNA, 5'EST, or consensus contigated 5' EST. If the remaining amino acids are 1005 
identical between the two sequences, the homology level would be 100%-20%=80% homology. No 
adjustments are made if the non-matched or non-ahgned sequences are internal or under any other 
conditions. 

In addition to the above described methods, other protocols are available to obtain extended 
25 cDNAs using 5' ESTs or consensus contigated 5'ESTs as outlined in the following paragraphs. 

Extended cDNAs may be prepared by obtaining mRNA from the tissue, cell, or organism of 
interest using mRNA preparation procedures utilizing polyA selection procedures or other techniques 
known to those skilled in the art. A first primer capable of hybridizing to the polyA tail of the mRNA is 
hybridized to the mRNA and a reverse transcription reaction is performed to generate a first cDNA 
30 strand. 

The first cDNA strand is hybridized to a second primer containing at least 10 consecutive 
nucleotides of the sequences of SEQ ID NOs 24-81 1 and 1600-1622. Preferably, the primer comprises 
at least 10, 12, 15, 17, 18, 20, 23, 25, or 28 consecutive nucleotides from the sequences of SEQ ID NOs 
24-81 1 and 1600-1622. In some embodiments, the primer comprises more than 30 nucleotides from the 
35 sequences of SEQ ID NOs 24-81 1 and 1600-1622. If it is desired to obtain extended cDNAs containing 
the full protein coding sequence, including the authentic translation initiation site, the second primer 
used contains sequences located upstream ofthe translation initiation site. The second primer is 
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extended to generate a second cDNA strand complementary to the first cDNA strand. Alternatively, RT- 
PCR may be performed as described above using primers from both ends of the cDNA to be obtained. 

Extended cDNAs containing 5' fragments of the mRNA may be prepared by hybridizing an 
mRNA comprising the sequences of SEQ ID NOs. 24-81 1 and 1600-1622 with a primer comprising a 
5 complementary to a fragment of an EST-related nucleic acid hybridizing the primer to the mRNAs, and 
reverse transcribing the hybridized primer to make a first cDNA strand from the mRNAs. Preferably, 
the primer comprises at least 10, 12, 15, 17, 18, 20, 23, 25, or 28 consecutive nucleotides of the 
sequences complementary to SEQ ID NOs. 24-81 1 and 1600-1622. 

Thereafter, a second cDNA strand complementary to the first cDNA strand is synthesized. The 

1 0 second cDNA strand may be made by hybridizing a primer complementary to sequences in the first 
cDNA strand to the first cDN A strand and extending the primer to generate the second cDNA strand. 

The double stranded extended cDNAs made using the methods described above are isolated 
and cloned. The extended cDNAs may be cloned into vectors such as plasmids or viral vectors capable 
of replicating in an appropriate host cell. For example, the host cell may be a bacterial, mammalian, 

1 5 avian, or insect cell. 

Techniques for isolating mRNA, reverse transcribing a primer hybridized to mRNA to generate 
a first cDNA strand, extending a primer to make a second cDNA strand complementary to the first 
cDNA strand, isolating the double stranded cDNA and cloning the double stranded cDNA are well 
known to those skilled in the art and are described in Current Protocols in Molecular Biology, John 

20 Wiley & Sons, Inc. 1 997 and Sambrook et ai. Molecular Cloning: A Laboratory Manual, Second 
Edition, Cold Spring Harbor Laboratory Press, 1989. 

Alternatively, other procedures may be used for obtaining fiill-length cDNAs or extended 
cDNAs. In one approach, full-length or extended cDNAs are prepared from mRNA and cloned into 
double stranded phagemids as follows. The cDNA library in the double stranded phagemids is then 

25 rendered single stranded by treatment with an endonuclcase, such as the Gene II product of the phage Fl 
and an exonuclease (Chang et ai. Gene 127:95-8, 1993). A biotinylated oligonucleotide comprising the 
sequence of a fiagment of an EST-ielated nucleic acid is hybridized to the single stranded phagemids. 
Preferably, the fragment comprises at least 10, 12, 15, 17, 18, 20, 23, 25, or 28 consecutive nucleotides 
of the sequences of SEQ ID NOs. 24-81 1 and 1600-1622. 

30 Hybrids between the biotinylated oligonucleotide and phagemids are isolated by incubating the 

hybrids with streptavidin coated paramagnetic beads and retrieving the beads with a magnet (Fry et aly 
Biotechniques, 13: 124-131, 1992). Thereafter, the resulting phagemids are released from the beads and 
converted into double stranded DNA using a primer specific for the 5' EST or consensus contigated 
5'EST sequence used to design the biotinylated oligonucleotide. Alternatively, protocols such as the 

35 Gene Trapper kit (Gibco BRL) may be used. The resulting double stranded DNA is transformed into 
bacteria. Extended cDNAs or fiill length cDNAs containing the 5* EST or consensus contigated 5'EST 
sequence are identified by colony PCR or colony hybridization. 
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Using any of the above described methods in section III, a plurality of extended cDNAs 
containing full-length protein coding sequences or portions of the protein coding sequences may be 
provided as cDNA libraries for subsequent evaluation of the encoded proteins or use in diagnostic assays 
as described below. 

5 

EXAMPLE 21 
Full Len^h cDNAs 

The procedures described in Example 19 and 20 were used to obtain extended cDNAs or fiill 
length cDNAs derived from 5' ESTs in a variety of tissues. The following list provides a few examples 

10 of cDNAs obtained by these means. 

Using this procedure, the full length cDNA of SEQ ID N0:1 (internal identification number 58- 
34-2-E7-FL2) was obtained. This cDNA encodes the signal peptide MWWFQQGLSFLPSALVIWTSA 
(SEQ ID NO:2) having a von Heijne score of 5.5. 

Using this approach, the fiill length cDNA of SEQ ID NO:3 (internal identification number 48- 

15 19-3-Gl-FLl) was obtained. This cDNA encodes the signal peptide MKKVLLLITAILAVAVG (SEQ 
ID NO: 4) having a von Heijne score of 82. 

The full length cDNA of SEQ ID N0:5 (internal identification number 58-35-2-F10-FL2) was 
also obtained using this procedure. This cDNA encodes a signal peptide LWLLFFLVTAIHA (SEQ ID 
NO:6) having a von Heijne score of 10.7. 

20 Furthermore, the polypeptides encoded by the extended or full-length cDNAs may be screened 

for the presence of known structural or ftinctional motifs or for the presence of signatures, small amino 
acid sequences which are well conserved amongst the members of a protein family. The results obtained 
for the polypeptides encoded by a few full-length cDNAs derived from 5'ESTs that were screened for 
the presence of known protein signatures and motifs using the ProScan software from the GCG 

25 package and the Prosite 15.0 database are provided below. 

The protein of SEQ ED NO: 8 encoded by the full-length cDNA SEQ ID NO: 7 (internal 
designation 78-8-3-E6-CL0_lC) and expressed in adult prostate belong to the 
phosphatidylethanolamine-binding protein from which it exhibits the characteristic PROSITE 
signature from positions 90 to 1 12. Proteins from this widespread family, from nematodes to fly, 

30 yeast, rodent and primate species, bind hydrophobic ligands such as phospholipids and nucleotides. 
They are mostly expressed in brain and in testis and are thought to play a role in cell growth and/or 
maturation, in regulation of the sperm maturation, motility and in membrane remodeling. They may 
act either through signal transduction or through oxidoreduction reactions (for a review see 
Schoentgen and Jolles, FEBS Letters, 369 :22-26 (1995)). Taken together, these data suggest that the 

35 protein of SEQ ED NO: 8 may play a role in cell growth, maturation and in membrane remodeling 
and/or may be related to male fertility. Thus, these protein may be useful in diagnosing and/or 
treating cancer, neurodegenerative diseases, and/or disorders related to male fertility and sterility. 
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The protein of SEQ ID No. 10 encoded by the full-length cDNA SEQ ID NO. 9 {internal 
designation 1 08-01 3-5-0-H9-FLC) shows homologies with a family of lysophospholipases 
conserved among eukaryotes (yeast, rabbit, rodents and human). In addition, some members of this 
family exhibit a calcium-independent phospholipase A2 activity (Portilla ei al J. Am. Soc, Nephro., 
5 9 :1178-1186 (1998)). All members of this family exhibit the active site consensus GXSXG motif of 
carboxylesterases that is also found in the protein of SEQ ID NO. 10 (position 54 to 58). hi addition, 
this protein may be a membrane protein with one transmembrane domain as predicted by the 
software TopPred II (Claros and von Heijne, CABIOSapplia Notes, 10 :685-686 (1994)). Taken 
together, these data suggest that the protein of SEQ ID NO: 10 may play a role in fatty acid 

10 metabolism, probably as a phospholipase. Thus, this protein or part therein, may be useful in 
diagnosing and/or treating several disorders including, but not limited to, cancer, diabetes, and 
neurodegenerative disorders such as Parkinson's and Alzheimer's diseases. It may also be useful in 
modulating inflammatory responses to infectious agents and/or to suppress graft rejection. 

The protein of SEQ ID NO: 12 encoded by the full-length cDNA SEQ ID NO: 1 1 (internal 

1 5 designation 108-004-5-0-DlO-FLC) shows remote homology to a subfamily of beta4- 

galactosyltransferases widely conserved in animals (human, rodents, cow and chicken). Such 
enzymes, usually type II membrane proteins located in the endoplasmic reticulum or in the Golgi 
apparatus, catalyzes the biosynthesis of glycoproteins, glycolipid glycans and lactose. Their 
characteristic features defined as those of subfamily A in Breton et al, J. Biochem., 123:1000-1009 

20 (1998) are pretty well conserved in the protein of SEQ ID NO: 12, especially the region I containing 
the DVD motif (positions 163-165) thought to be involved either in UDP binding or in the catalytic 
process itself In addition, the protein of SEQ ID NO: 12 has the typical structure of a type II protein. 
Indeed, it contains a short 28-amino-acid-long N-terminal tail, a transmembrane segment from 
positions 29 to 49 and a large 278-amino-acid-long C-terminal tail as predicted by the software 

25 TopPred II (Claros and von Heijne, CABIOS applic. Notes, 10 :685-686 (1994)). Taken together, 
these data suggest that the protein of SEQ ID NO: 12 may play a role in the biosynthesis of 
polysaccharides, and of the carbohydrate moieties of glycoproteins and glycolipids and/or in cell-cell 
recognition. Thus, this protein may be useful in diagnosing and/or treating several types of disorders 
including, but not limited to, cancer, atherosclerosis, cardiovascular disorders, autoimmune disorders 

30 and rheumatic diseases including rheumatoid arthritis. 

The protein of SEQ ID NO: 14 encoded by the full-length cDNA SEQ ID NO: 13 (internal 
designation 108-009-5-0-A2-FLC) shows extensive homology to the bZIP family of transcription 
factors, and especially to the human luman protein (Lu et ai, MoL Cell Biol, 17 :5 1 17-5126 
(1997))). The match include the whole bZIP domain composed of a basic DNA-binding domain and 

35 of a leucine zipper allowing protein dimerization. The basic domain is conserved in the protein of 
SEQ ID NO: 14 as shown by the characteristic PROSITE signature (positions 224-237) except for a 
conservative substitution of a glutamic acid with an aspartic acid in position 233. The typical 
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PROSITE signature for leucine zipper is also present (positions 259 to 280). Taken together, these 
data suggest that the protein of SEQ ID NO: 14 may bind to DNA, hence regulating gene expression 
as a transcription factor. Thus, this protein may be useful in diagnosing and/or treating several types 
of disorders including, but not limited to, cancer 
5 Bacterial clones containing plasmids containing the full length cDNAs described above are 

presently stored in the inventor's laboratories under the internal identification numbers provided above. 
The inserts may be recovered from the deposited materials by growing an aliquot of the appropriate 
bacterial clone in the appropriate medium. The plasmid DNA can then be isolated using plasmid 
isolation procedures familiar to those skilled in the art such as alkaline lysis minipreps or large scale 

1 0 alkaline lysis plasmid isolation procedures. If desired the plasmid DNA may be further enriched by 
centrifugation on a cesium chloride gradient, size exclusion chromatography, or anion exchange 
chromatography. The plasmid DNA obtained using these procedures may then be manipulated using 
standard cloning techniques femiliar to those skilled in the art. Alternatively, a PCR can be done with 
primers designed at both ends of the insertion. The PCR product which corresponds to the cDNA insert 

1 5 can then be manipulated using standard cloning techniques familiar to those skilled in the art. 

V. Expression of Proteins or Polypeptides Encoded by EST-related nucleic adds or Fragments 
thereof 

EST-related nucleic acids, fragments of EST-related nucleic acids, positional segments of EST- 
20 related nucleic acids, and fragments of positional segments of EST-related nucleic acids may be used to 
express the polypeptides which they encode. In particular, they may be used to express EST-related 
polypeptides, fragments of EST-related polypeptides, positional segments of EST-related polypeptides, 
or fragments of positional segments of EST-related polypeptides. In some embodiments, the EST- 
related nucleic acids, positional segments of EST-related nucleic acids, and fragments of positional 
25 segments of EST-related nucleic acids may be used to express the fiill polypeptide (i.e. the signal peptide 
and the mature polypeptide) of a secreted protein, the mature protein {Le. the polypeptide generated after 
cleavage of the signal peptide), or the signal peptide of a secreted protein. If desired, nucleic acids 
aicoding the signal peptide may be used to facilitate secretion of the expressed protein. It will be 
appreciated that a plurality of EST-related nucleic acids, fragments of EST-related nucleic acids, 
30 positional segments of EST-related nucleic acids, or fragments of positional segments of EST-related 
nucleic acids may be simultaneously cloned into expression vectors to create an expression library for 
analysis of the encoded proteins as described below. 



35 



EXAMPLE 22 

Expression of the Proteins Encoded bv the Genes Corresponding to the 
5'ESTs or Consensus Contieated 5' ESTs 
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To express their encoded proteins, the EST-related nucleic acids, fragments of EST-related 
nucleic acids, positional segments of EST-related nucleic acids, or fragments of positional segments of 
EST-related nucleic acids are cloned into a suitable expression vector. In some instances, nucleic acids 
encoding EST-related polypeptides, fragments of EST-related polypeptides, positional segments of EST- 
5 related polypeptides or fragments of positional segments of EST-related polypeptides may be cloned into 
a suitable expression vector. 

In some embodiments, the nucleic acids inserted into the expression vector may comprise the 
coding sequence of a sequence selected from the group consisting of SEQ ID NOs. 24-81 1 . In other 
embodiments, the nucleic acids inserted into the expression vector may comprise may comprise the fiill 

1 0 coding sequence (i.e. the nucleotides encoding the signal peptide and the mature polypeptide) of one of 
SEQ ID Nos. 766-792. In some embodiments, the nucleic acid inserted into the expression vector may 
comprise the nucleotides of one of the sequences of SEQ ID Nos. 766-792 which encode the mature 
polypeptide (/.e the nucleotides encoding the polypeptide generated after cleavage of the signal 
peptide). In further embodiments, the nucleic acids inserted into the expression vector may comprise the 

1 5 nucleotides of 24-728 and 766-792 which encode the signal peptide to facilitate secretion of the 
expressed protein. The nucleic acids inserted into the expression vectors may also contain sequences 
upstream of the sequences encoding the signal peptide, such as sequences which regulate expression 
levels or sequences which confer tissue specific expression. 

The nucleic acid inserted into the expression vector may encode a polypeptide comprising the 

20 one of the sequences of SEQ ID Nos. 812-1599. In some embodiments, the nucleic acid inserted into 
the expression vector may encode the full polypeptide sequence (i.e. the signal peptide and the mature 
polypeptide) included in one of SEQ ID Nos. 1554-1580. In other embodiments, the nucleic acid 
inserted into the expression vector may encode the mature polypeptide (Le. the polypeptide generated 
after cleavage of the signal peptide) included in one of the sequences of SEQ ID Nos. 1554-1580. In 

25 further embodiments, the nucleic acids inserted into the expression vector may encode the signal peptide 
included in one of the sequences of 8 1 2- 1 5 1 6 and 1 5 54- 1 5 80. 

The nucleic acid encoding the protein or polypeptide to be expressed is operably linked to a 
promoter in an expression vector using conventional cloning technology. The expression vector may be 
any of the mammalian, yeast, insect or bacterial expression systems known in the art. Commercially 

30 available vectors and expression systems are available firom a variety of suppliers including Genetics 
Institute (Cambridge, MA), Stratagene (La Jolla, California), Promega (Madison, Wisconsin), and 
Invitrogen (San Diego, California). If desired, to enhance expression and facilitate proper protein 
folding, the codon context and codon pairing of the sequence may be optimized for the particular 
expression organism in which the expression vector is introduced, as explained by Hatfield, et ai, U.S. 

35 Patent No. 5,082,767. 

The following is provided as one exemplary method to express the proteins encoded by the 
nucleic acids described above. In some instances the nucleic acid encoding the protein or polypeptide to 



wo 99/53051 PCT/IB99/00712 

90 

be expressed includes a methionine initiation codon and a polyA signal. If the nucleic acid encoding the 
polypeptide to be expressed lacks a methionine to serve as the initiation site, an initiating methionine can 
be introduced next to the first codon of the nucleic acid using conventional techniques. Similarly, if the 
nucleic acid encoding the protein or polypeptide to be expressed lacks a polyA signal, this sequence can 
5 be added to the construct by, for example, splicing out the polyA signal from pSG5 (Stratagene) using 
Bgll and Sail restriction endonuclease enzymes and incorporating it into the mammalian expression 
vector pXTl (Stratagene). pXTl contains the LTRs and a portion of the gag gene from Moloney 
Murine Leukemia Virus. The position of the LTRs in the construct allow efficient stable transfection. 
The vector includes the Herpes Simplex thymidine kinase promoter and the selectable neomycin gene. 

1 0 The nucleic acid encoding the polypeptide to be expressed is obtained by PGR from the bacterial vector 
using oligonucleotide primers complementary to the nucleic acid encoding the protein or polypeptide to 
be expressed and containing restriction endonuclease sequences for Pst I incorporated into the 5 'primer 
and Bglll at the 5' end of 3' primer, taking care to ensure that the nucleic acid encoding the protein or 
polypeptide to be expressed is correctly positioned Avith respect to the poly A signal. The purified 

1 5 fragment obtained from the resulting PGR reaction is digested with PstI, blunt ended with an 

exonuclease, digested with Bgl n, purified and hgated to pXTl, now containing a poly A signal and 
digested with Bglll. 

The ligated product is transfected into mouse NTH 3T3 cells using Lipofectin (Life 
Technologies, Inc., Grand Island, New York) under conditions outlined in the product specification. 
20 Positive transfectants are selected after growing the transfected cells in 600 [ig/ml G4 1 8 (Sigma, St. 
Louis, Missouri). 

Alternatively, the nucleic acid encoding the protein or polypeptide to be expressed may be 
cloned into pED6dpc2. The resulting pED6dpc2 constructs may be transfected into a suitable host cell, 
such as COS 1 cells. Methotrexate resistant cells are selected and expanded. The expressed protein or 

25 polypeptide may be isolated, purified, or enriched as described above. 

To confirm expression of the desired protein or polypeptide, the proteins or polypeptides 
produced by cells containing a vector with a nucleic acid insert encoding the protein or polypeptide are 
compared to tiiose lacking such an insert. The expressed proteins are detected using techniques familiar 
to those skilled in the art such as Coomassie blue or silver staining or using antibodies against the protein 

30 or polypeptide encoded by the nucleic acid insert. Antibodies capable of specifically recognizing the 
protein of interest may be generated using synthetic 15-mer peptides having a sequwice encoded by the 
appropriate nucleic acid. The synthetic peptides are injected into mice to generate antibody to the 
polypeptide encoded by the nucleic acid. 

If the proteins or polypeptides encoded by the nucleic acid inserts are secreted, medium 

35 prepared from the host cells or organisms containing an expression vector which contains a nucleic acid 
insert encoding the desired protein or polypeptide is compared to medium prepared from the control cells 
or organism. The presence of a band in medium from the cells containing the nucleic acid insert which 
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is absent from preparations from the control cells indicates that the protein or polypeptide encoded by the 
nucleic acid insert is being expressed and secreted. Generally, the band corresponding to the protein 
encoded by the nucleic acid insert will have a mobility near that expected based on the number of amino 
acids in the open reading frame of the nucleic acid insert. However, the band may have a mobility 
5 different than that expected as a result of modifications such as glycosylation, ubiquitination, or 
enzymatic cleavage. 

Alternatively, if the protein expressed from the above expression vectors does not contain 
sequences directing its secretion, the proteins expressed from host cells containing an expression vector 
with an insert encoding a secreted protein or portion thereof can be compared to the proteins expressed 

10 in control host cells containing the expression vector without an insert. The presence of a band in 

samples from cells containing the expression vector with an insert which is absent in samples from cells 
containing the expression vector without an insert indicates that the desired protein or portion thereof is 
being expressed. Generally, the band will have the mobility expected for the secreted protein or portion 
thereof. However, the band may have a mobility different than that expected as a result of modifications 

1 5 such as glycosylation, ubiquitination, or enzymatic cleavage. 

The expressed protein or polypeptide may be purified, isolated or enriched using a variety of 
methods. In some methods, the protein or polypeptide may be secreted into the culture medium via a 
native signal peptide or a heterologous signal peptide operably linked thereto. In some methods, the 
protein or polypeptide may be linked to a heterologous polypeptide which facilitates its isolation, 

20 purification, or enrichment such as a nickel binding polypeptide. The protein or polypeptide may also be 
obtained by gel electrophoresis, ion exchange chromatography, size chromatography, hplc, salt 
precipitation, immunoprecipitation, a combination of any of the preceding methods, or any of the 
isolation, purification, or enrichment techniques familiar to those skilled in the art. 

The protein encoded by the nucleic acid insert may also be purified using standard 

25 immunochromatography techniques using immunoaffinity chromatography with antibodies directed 
against the encoded protein or polypeptide as described in more detail below. If antibody production is 
not possible, the nucleic acid insert encoding the desired protein or polypeptide may be incorporated into 
expression vectors designed for use in purification schemes employing chimeric polypeptides. In such 
strategies, the coding sequence of the nucleic acid insert is ligated in fiame with the gene encoding the 

30 other half of the chimera. The other half of the chimera may be P-globin or a nickel binding 

polypeptide. A chromatography matrix having antibody to P-globin or nickel attached thereto is then 
used to purify the chimeric protein. Protease cleavage sites may be engineered between the P-globin 
gene or the nickel binding polypeptide and the extended cDNA or portion thereof. Thus, the two 
polypeptides of the chimera may be separated from one another by protease digestion. 

35 One useful expression vector for generating p-globin chimerics is pSG5 (Stratagene), which 

encodes rabbit p-globin. Intron II of the rabbit p-globin gene facilitates splicing of the expressed 
transcript, and the polyadenylation signal incorporated into the construct increases the level of 
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expression. These techniques as described are well known to those skilled in the art of molecular 
biology. Standard methods are published in methods texts such as Davis et ai, {Basic Methods in 
Molecular Biology, L.G. Davis, M.D. Dibner, and J.F. Battey, ed., Elsevier Press, NY, 1986) and many 
of the methods are available from Stratagene, Life Technologies, Inc., or Promega. Polypeptide may 
5 additionally be produced from the construct using in vitro translation systems such as the In vitro 
Express^ Translation Kit (Stratagene). 

Following expression and purification of the proteins or polypeptides encoded by the nucleic 
acid inserts, the purified proteins may be tested for the ability to bind to the surface of various cell types 
as described in Example 23 below. It will be appreciated that a plurality of proteins expressed from 
10 these nucleic acid inserts may be included in a panel of proteins to be simultaneously evaluated for the 
activities specifically described below, as well as other biological roles for which assays for determining 
activity are available. 

EXAMPLE 23 

Analysis of Secreted Proteins to Determine Whether thev Bind to the Cell Surface 

1 5 The EST-related nucleic acids, fragments of EST-related nucleic acids, positional segments of 

EST-related nucleic acids, fragments of positional segments of EST-related nucleic acids, nucleic acids 
encoding the EST-related polypeptides, nucleic acids encoding fragments of the EST-related 
polypeptides, nucleic acids encoding positional segments of EST-related polypeptides, or nucleic acids 
encoding fragments of positional segments of EST-related polypeptides are cloned into expression 

20 vectors such as those described in Example 22. The encoded proteins or polypeptides are purified, 

isolated, or enriched as described above. Following purification, isolation, or enrichment, the proteins or 
polypeptides are labeled using techniques known to those skilled in the art. The labeled proteins or 
polypeptides are incubated with cells or cell lines derived from a variety of organs or tissues to allow the 
proteins to bind to any receptor present on the cell surface. Following the incubation, the cells are 

25 washed to remove non-specificaliy bound proteins or polypeptides. The specifically bound labeled 
proteins or polypeptides are detected by autoradiography. Alternatively, unlabeled proteins or 
polypeptides may be incubated with the cells and detected with antibodies having a detectable label, such 
as a fluorescent molecule, attached thereto. 

Specificity of cell surface binding may be analyzed by conducting a competition analysis in 

30 which various amounts of unlabeled protein or polypeptide are incubated along with the labeled protein 
or polypeptide. The amount of labeled protein or polypeptide bound to the cell surface decreases as the 
amount of competitive unlabeled protein or polypeptide increases. As a control, various amounts of an 
unlabeled protein or polypeptide unrelated to the labeled protein or polypeptide is included in some 
binding reactions. The amount of labeled protein or polypeptide bound to the cell suriface does not 

35 decrease in binding reactions containing increasing amounts of unrelated unlabeled protein, indicating 
that the protein or polypeptide encoded by the nucleic acid binds specifically to the cell surface. 
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As discussed above, human proteins have been shown to have a number of important 
physiological effects and, consequently, represent a valuable therapeutic resource. The human proteins 
or polypeptides made as described above may be evaluated to determine their physiological activities as 
described below. 

5 

EXAMPLE 24 
Assaying the Expressed Proteins or Polypeptides for Cytokine, 
Cell Proliferation or Cell Differentiation Activity 
As discussed above, some human proteins act as cytokines or may affect cellular proliferation or 
10 differentiation. Many protein factors discovered to date, including all knovra cytokines, have exhibited 
activity in one or more factor dependent cell proliferation assays, and hence the assays serve as a 
convenient confirmation of cytokine activity. The activity of a protein or polypeptide of the present 
invention is evidenced by any one of a number of routine factor dependent cell proliferation assays for 
cell hnes including, without limitation, 32D, DA2, DAIG, TIO, B9, B9/1 1, BaF3, MC9/G, (preB 
15 Nf),2E8,RB5,DAl. 123,T1165, HT2, CTLL2,TF-1, Mo7c andCMK. The proteins or polypeptides 
prepared as described above may be evaluated for their ability to regulate T cell or thymocyte 
proliferation in assays such as those described above or in the following references: Current Protocols 
in Immunology, Ed. by J.E. Coligan et al, Greene Publishing Associates and Wiley-Interscience; Takai 
et al. J. Immunol 137:3494-3500, 1986., Bertagnolli et al J. Immunol 145:1706-1712, 1990., 
20 Bertagnolli et al. Cellular Immunology 133:327-341, 1991. Bertagnolli, et al J. Immunol 149:3778- 
3783, 1992; Bowman et al, 1 Immunol 152:1756-1761, 1994. 

In addition, numerous assays for cytokine production and/or the proliferation of spleen cells, 
lymph node cells and thymocytes are known. These include the techniques disclosed in Current 
Protocols in Immunology. J.E. Coligan et al Eds., 1:3.12.1-3.12.14, John Wiley and Sons, Toronto. 
25 1994; and Schreiber, R.D. In Current Protocols in Immunology., supra 1 : 6.8.1-6.8.8. 

The proteins or polypeptides prepared as described above may also be assayed for the ability to 
regulate the proliferation and differentiation of hematopoietic or lymphopoietic cells. Many assays for 
such activity are familiar to those skilled in the art, including the assays in the following references: 
Bottomly et al. In Current Protocols in Immunology., supra. 1 : 6.3.1-6.3.12,; deVries et al, J. Exp, 
30 Med. 173:1205-1211, 1991; Moreaue/ a/., A/bri/re 36:690-692, 1988; Greenbergere/ a/., Proc. Mi//. 
Acad. Sci. U.S.A. 80:2931-2938, \9%y,^Qr62ai,K.,\iiCurrent Protocols in Immunology., supra. 1 : 
6.6.1-6.6.5; Smith et al, Proc. Natl Acad. Sci. US.A. 83:1857-1861, 1986; Bennett et al in Current 
Protocols in Immunology supra 1 :6.\5A;C\dir\c\tzetal hi Current Protocols in Immunology, supra I : 
6.13.1. 

35 The proteins or polypeptides prepared as described above may also be assayed for their ability to 

regulate T-cell responses to antigens. Many assays for such activity are familiar to those skilled in the 
art, including the assays described in the following references: Chapter 3 (In vitro Assays for Mouse 



wo 99/53051 PCT/IB99/00712 

94 

Lymphocyte Function), Chapter 6 (Cytokines and Their Cellular Receptors) and Chapter 7, 
(Immunologic Studies in Humans) in Current Protocols in Immunology supra; Weinberger et al, Proc. 
Natl. Acad. Sci. 77:6091-6095, 1980; Weinberger e/ a/., £wr. J. //w/mwaz. 11:405-411, 1981;Takaie/ 
aL,J. Immunol 137:3494-3500, 1986; Takai etai.J. Immunol. 140:508-512, 1988. 

5 Those proteins or polypeptides which exhibit cytokine, cell proliferation, or cell differentiation 

activity may then be formulated as pharmaceuticals and used to treat clinical conditions in which 
induction of cell proliferation or differentiation is beneficial- Alternatively, as described in more detail 
below, nucleic acids encoding these proteins or polypeptides or nucleic acids regulating the expression 
of these proteins or polypeptides may be introduced into appropriate host cells to increase or decrease the 

1 0 expression of the proteins or polypeptides as desired. 

EXAMPLE 25 

Assaying the Expressed Proteins or Polypeptides 
for Activity as Immune System Regulators 

1 5 The proteins or polypeptides prepared as described above may also be evaluated for their effects 

as immune regulators. For example, the proteins or polypeptides may be evaluated for their activity to 
influence thymocyte or splenocyte cytotoxicity. Numerous assays for such activity are familiar to those 
skilled in the art including the assays described in the following references: Chapter 3 {In vitro Assays 
for Mouse Lymphocyte Function 3.1-3.19) and Chapter 7 (Immunologic studies in Humans) in Current 

20 Protocols in Immunology , J.E. Coligan et al Eds, Greene Publishing Associates and Wiley-Interscience; 
Herrmann et al, Proc. Natl Acad Sci, USA 78:2488-2492, 1981; Herrmann et al,l Immunol 
128:1968-1974, 1982; mud^ et al.llmmunol 135:1564-1572, \9S5;Takd\et al, J. Immunol 
137:3494-3500, 1986; Takai et al, 1 Immunol 140:508-512, 1988; Bowman et al, J. Virology 61:1992- 
1998; Bertagnolh e/ fl/. Celllmmunol 133:327-341, \99\\Bxo^etal, J. Immunol 153:3079-3092, 

25 1994. 

The proteins or polypeptides prepared as described above may also be evaluated for their effects 
on T-cell dependent immunoglobulin responses and isotype switching. Numerous assays for such 
activity are familiar to those skilled in the art, including the assays disclosed in the following references: 
Maliszewski,/ Immunol 144:3028-3033, 1990; Mond etal in Current Protocols in Immunology^ 1 : 
30 3.8.1-3.8.16, 

The proteins or polypeptides prepared as described above may also be evaluated for their effect 
on immune effector cells, including their effect on Thl cells and cytotoxic lymphocytes. Numerous 
assays for such activity are familiar to those skilled in the art, including the assays disclosed in the 
following references: Chapter 3 (In vitro Assays for Mouse Lymphocyte Function 3.1-3.19) and Chapter 
35 7 (Immunologic Studies in Humans) in Current Protocols in Immunology, supra; Takai et aL 
Immunol 137:3494-3500, 1986; Takai etal, Immunol 140:508-512, 1988; Bertagnolli etal,, J. 
Immunol 149:3778-3783, 1992. 
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The proteins or polypeptides prepared as described above may also be evaluated for their effect 
on dendritic cell mediated activation of naive T-cclls. Numerous assays for such activity are familiar to 
those skilled in the art, including the assays disclosed in the following references: Guery et ai, J. 
Immunol 134:536-544, 1995; Inabae/a/., J.Exp.Med. 173:549-559, 1991; Macatoniae/a/., J. 
5 Immunol. 154:5071.5079, 1995; Porgadorer a// A/erf 182:255-260, 1995; Naire/a/.,/ Virol. 
67:4062-4069, 1993; Huang et ai. Science 264:961-965, 1994; Macatonia et alJ. Exp. Med 169:1255- 
1264, 1989; Bhardwaj et ai, Journal of Clinical Investigation 94:797-807, 1994; and Inaba et alJ. Exp. 
AfeJ 172:631-640, 1990. 

The proteins or polypeptides prepared as described above may also be evaluated for their 

10 influence on the lifetime of lymphocytes. Numerous assays for such activity are familiar to those skilled 
in the art, including the assays disclosed in the following references: Darzynkiewicz et al. Cytometry 
13:795-808, 1992; Gorczyca et ai. Leukemia 7:659-670, 1993; Gorczyca etal.. Cancer Res. 53:1945- 
1951, 1993; Itoh et al.. Cell 66:233-243, 1991; Zacharchuk, 7. Immunol 145:4037-4045, 1990; Zamai et 
al. Cytometry^ 14:891-897, 1993; Gorczyca et al. Int. J. Oncol 1:639-648, 1992. 

1 5 The proteins or polypeptides prepared as described above may also be evaluated for their 

influence on early steps of T-cell commitment and development. Numerous assays for such activity are 
familiar to those skilled in the art, including without limitation the assays disclosed in the following 
references: Antica et al. Blood 84:1 1 1-1 17, 1994; Fine et al. Cell Immunol 155:1 1 1-122, 1994; Galy 
et al , Blood 85:2770-2778, 1 995; Toki et al , Froc. Nat. Acad Scl USA 88:7548-755 1 , 1 99 1 . 

20 Those proteins or polypeptides which exhibit activity as immune system regulators activity may 

then be formulated as pharmaceuticals and used to treat clinical conditions in which regulation of 
immune activity is beneficial. For example, the protein or polypeptide may be useful in the treatment of 
various immune deficiencies and disorders (including severe combined immunodeficiency), e.g., in 
regulating (up or down) growth and proUferation of T and/or B lymphocytes, as well as effecting the 

25 cytolytic activity of NK cells and other cell populations. These immune deficiencies may be genetic or 
be caused by viral (e.g., HIV) as well as bacterial or fiingal infections, or may result from autoimmune 
disorders. More specifically, infectious diseases caused by viral, bacterial, fungal or other infection may 
be treatable using the protein or polypeptide including infections by HIV, hepatitis viruses, 
herpesviruses, mycobacteria, Leishmania spp., plamodium. and various fungal infections such as 

30 candidiasis. Of course, in this regard, a protein or polypeptide may also be useful where a boost to the 
immune system generally may be desirable, i.e, in the treatment of cancer. 

Alternatively, the proteins or polypeptides prepared as described above may be used in 
treatment of autoimmune disorders including, for example, connective tissue disease, multiple sclerosis, 
systemic lupus erythematosus, rheumatoid arthritis, autoimmune pulmonary inflammation, Guillain- 

35 Barre syndrome, autoimmune thyroiditis, insulin dependent diabetes mellitis, myasthenia gravis, graft- 
versus-host disease and autoimmune inflammatory eye disease. Such a protein or polypeptide may also 
to be useful in the treatment of allergic reactions and conditions, such as asthma (particularly allergic 
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asthma) or other respiratory problems. Other conditions, in which immune suppression is desired 
(including, for example, organ transplantation), may also be treatable using the protein or polypeptide. 

Using the proteins or polypeptides of the invention it may also be possible to regulate immune 
responses either up or down. Down regulation may involve inhibiting or blocking an immune response 
5 already in progress or may involve preventing the induction of an immune response. The functions of 
activated T-cells may be inhibited by suppressing T cell responses or by inducing specific tolerance in T 
cells, or both. Immunosuppression of T cell responses is generally an active non-antigen-specific 
process which requires continuous exposure of the T cells to the suppressive agent. Tolerance, which 
involves inducing non-responsiveness or anergy in T cells, is distinguishable from immunosuppression 

10 in that it is generally antigen-specific and persists after the end of exposure to the tolcrizing agent. 

Operationally, tolerance can be demonstrated by the lack of a T cell response upon reexposure to specific 
antigen in the absence of the tolerizing agent. 

Down regulating or preventing one or more antigen fiinctions (including without limitation B 
lymphocyte antigen functions, such as, for example, B7 costimulation), e.g., preventing high level 

1 5 lymphokine synthesis by activated T cells, will be useful in situations of tissue, skin and organ 
transplantation and in graft-versus-host disease (GVHD). For example, blockage of T cell function 
should result in reduced tissue destruction in tissue transplantation. Typically, in tissue transplants, 
rejection of the transplant is initiated through its recognition as foreign by T cells, followed by an 
immune reaction that destroys the transplant. The administration of a molecule which inhibits or blocks 

20 interaction of a B7 lymphocyte antigen with its natural ligand(s) on immune cells (such as a soluble, 
monomeric form of a peptide having B7-2 activity alone or in conjunction with a monomeric form of a 
peptide having an activity of another B lymphocyte antigen (e.g., B7-1, B7-3) or blocking antibody), 
prior to transplantation, can lead to the binding of the molecule to the natural ligand(s) on the immune 
cells without transmitting the corresponding costimulatory signal. Blocking B lymphocyte antigen 

25 function in this matter prevents cytokine synthesis by immune cells, such as T cells, and thus acts as an 
immunosuppressant. Moreover, the lack of costimulation may also be sufficient to anergize the T cells, 
thereby inducing tolerance in a subject. Induction of long-term tolerance by B lymphocyte antigen- 
blocking reagents may avoid the necessity of repeated administration of these blocking reagents. To 
achieve sufficient immunosuppression or tolerance in a subject, it may also be necessary to block the 

30 function of a combination of B lymphocyte antigens. 

The efficacy of particular blocking reagents in preventing organ transplant rejection or GVHD 
can be assessed using animal models that are predictive of efficacy in humans. Examples of appropriate 
systems which can be used include allogeneic cardiac grafts in rats and xenogeneic pancreatic islet cell 
grafts in mice, both of which have been used to examine the immunosuppressive effects of CTLA4Ig 

35 fusion proteins in vivo as described in Lenschow et al. Science 257:789-792 (1992) and Turka et al , 
Proc, Natl. Acad, Sci USA, 89:1 1102-1 1 105 (1992). In addition, murine models of GVHD (see Paul ed.. 
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Fundamental Immunology, Raven Press, New York, 1989, pp. 846-847) can be used to determine the 
effect of blocking B lymphocyte antigen function in vivo on the development of that disease. 

Blocking antigen function may also be therapeutically useful for treating autoimmune diseases. 
Many autoimmune disorders are the result of inappropriate activation of T cells that are reactive against 
5 self tissue and which promote the production of cytokines and autoantibodies involved in the pathology 
of the diseases. Preventing the activation of autoreactive T cells may reduce or eliminate disease 
symptoms. Administration of reagents which block costimulation of T cells by disrupting 
receptor/Iigand interactions of B lymphocyte antigens can be used to inhibit T cell activation and prevent 
production of autoantibodies or T cell-derived cytokines which potentially involved in the disease 

1 0 process. Additionally, blocking reagents may induce antigen-specific tolerance of autoreactive T cells 
which could lead to long-term relief from the disease. The efficacy of blocking reagents in preventing or 
alleviating autoimmune disorders can be determined using a number of well-characterized animal 
models of human autoimmune diseases. Examples include murine experimental autoimmune 
encephalitis, systemic lupus erythmatosis in MRL/pr/pr mice or NZB hybrid mice, murine autoimmuno 

1 5 collagen arthritis, diabetes mellitus in OD mice and BB rats, and murine experimental myasthenia gravis 
(see Paul ed., Fundamental Immunology, Raven Press, New York, 1989, pp. 840-856). 

Upregulation of an antigen function (preferably a B lymphocyte antigen function), as a means of 
up regulating immune responses, may also be useful in therapy. Upregulation of immune responses may 
involve either enhancing an existing immune response or eliciting an initial immune response as shown 

20 by the following examples. For instance, enhancing an immune response through stimulating B 
lymphocyte antigen function may be useful in cases of viral infection. In addition, systemic viral 
diseases such as influenza, the common cold, and encephalitis might be alleviated by the administration 
of stimulatory form of B lymphocyte antigens systemically. 

Alternatively, antiviral immune responses may be enhanced in an infected patient by removing 

25 T cells from the patient, costimulating the T cells in vitro with viral antigen-pulsed APCs either 
expressing the proteins or polypeptides described above or together with a stimulatory form of the 
protein or polypeptide and reintroducing flie in vitro primed T cells into the patient. The infected cells 
would now be capable of delivering a costimulatory signal to T cells in vivo, thereby activating the T 
cells. 

30 In another application, upregulation or enhancement of antigen function (preferably B 

lymphocyte antigen function) may be useful in the induction of tumor immunity. Tumor cells (e.g., 
sarcoma, melanoma, lymphoma, leukemia, neuroblastoma, carcinoma) transfected with one of the 
above-described nucleic acids encoding a protein or polypeptide can be administered to a subject to 
overcome tumor-specific tolerance in the subject If desired, the tumor cell can be transfected to express 

35 a combination of peptides. For example, tumor cells obtained from a patient can be transfected ex vivo 
with an expression vector directing the expression of a peptide having B7-2-like activity alone, or in 
conjunction with a peptide having B7-l-like activity and/or B7-3-like activity. The transfected tumor 
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cells are returned to the patient to result in expression of the peptides on the surface of the transfected 
cell. Alternatively, gene therapy techniques can be used to target a tumor cell for transfection in vivo. 

The presence of the protein or polypeptide encoded by the nucleic acids described above having 
the activity of a B lymphocyte antigen(s) on the surface of the tumor cell provides the necessary 
5 costimulation signal to T cells to induce a T cell mediated immune response against the transfected 
tumor cells. In addition, tumor cells which lack or which fail to reexpress sufficient amounts of MHC 
class I or MHC class n molecules can be transfected with nucleic acids encoding all or a portion of (e.g., 
a cytoplasmic-domain truncated portion) of an MHC class I a chain and P2 microglobulin or an MHC 
class n a chain and an MHC class n P chain to thereby express MHC class I or MHC class II proteins 

10 on the cell surface, respectively. Expression of the appropriate MHC class I or class n molecules in 
conjunction with a peptide having the activity of a B lymphocyte antigen (e.g., B7-1, B7-2, B7-3) 
induces a T cell mediated immune response against the transfected tumor cell. Optionally, a nucleic acid 
encoding an antisense construct which blocks expression of an MHC class n associated protein, such as 
the invariant chain, can also be cotransfected with a DNA encoding a protein or polypeptide having the 

1 5 activity of a B lymphocyte antigen to promote presentation of tumor associated antigens and induce 
timior specific immunity. Thus, the induction of a T cell mediated immune response in a human subject 
may be sufficient to overcome tumor-specific tolerance in the subject. Alternatively, as described in 
more detail below, nucleic acids encoding these immune system regulator proteins or polypeptides or 
nucleic acids regulating the expression of such proteins or polypeptides may be introduced into 

20 appropriate host cells to increase or decrease the expression of the proteins as desired. 

EXAMPLE 26 
Assaying the Expressed Proteins or Polypeptides for 
Hematopoiesis Regulating Activity 
25 The proteins or polypeptides encoded by the nucleic acids described above may also be 

evaluated for their hematopoiesis regulating activity. For example, the effect of the proteins or 
polypeptides on erhbryonic stem cell differentiation may be evaluated. Numerous assays for such 
activity are familiar to those skilled in the art, including the assays disclosed in the following references: 
Johansson era/. Cell. BioL 15:141-151, 1995; Keller eM/., Mo/. CellBioL 13:473-486, 1993; 
30 McClanahan e/ a/., ^/ot?^/ 81:2903-2915, 1993. 

The proteins or polypeptides encoded by the nucleic acids described above may also be 
evaluated for their influence on the lifetime of stem cells and stem cell differentiation. Numerous assays 
for such activity are familiar to those skilled in the art, including the assays disclosed in the following 
references: Freshney, M.G. Methylcellulose Colony Forming Assays, in Culture of Hcmatotx)ietic Cells . 
35 R J. Freshney, et al Eds. pp. 265-268, Wiley-Liss, Inc., New York, NY. 1 994; Hirayama et al , Proc, 
Natl Acad Sci. USA 89:5907-591 1, 1992; McNiece, I.K. and Briddell, R.A. Primitive Hematopoietic 
Colony Forming Cells with High Proliferative Potential, in Culture of Hematopoietic Cells , supra; 
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Ncben et ai. Experimental Hematology 22:353-359, 1994; Ploemacher, R.E. Cobblestone Area Fonning 
Cell Assay, In Culture of Hematopoietic Cells, supra; Spooncer, E., Dexter, M. and Allen, T. Long Term 
Bone Marrow Cultures in the Presence of Stromal Cells, in Culture of Hematopoietic Cells supra; and 
Sutherland, H.J. Long Term Culture Initiating Cell Assay, in Culture of Hematopoietic Cells , supra.. 
5 Those proteins or polypeptides which exhibit hematopoiesis regulatory activity may then be 

formulated as pharmaceuticals and used to treat clinical conditions in which regulation of hematopoeisis 
is beneficial. For example, a protein or polypeptide of the present invention may be useful in regulation 
of hematopoiesis and, consequently, in the treatment of myeloid or lymphoid cell deficiencies. Even 
marginal biological activity in support of colony forming cells or of factor-dependent cell lines indicates 

10 involvement in regulating hematopoiesis, e.g. in supporting the growth and proliferation of erythroid 
progenitor cells alone or in combination with other cytokines, tiiereby indicating utility, for example, in 
treating various anemias or for use in conjunction with irradiation/chemotherapy to stimulate the 
production of erythroid precursors and/or erythroid cells; in supporting the growth and proliferation of 
myeloid cells such as granulocytes and monocytes/macrophages (i.e., traditional CSF activity) useful, for 

1 5 example, in conjunction with chemotherapy to prevent or treat consequent myelo-suppression; in 
supporting the growth and proliferation of megakaryocytes and consequently of platelets thereby 
allowing prevention or treatment of various platelet disorders such as thrombocytopenia, and generally 
for use in place of or complimentary to platelet transfusions; and/or in supporting the growth and 
proliferation of hematopoietic stem cells which are capable of maturing to any and all of the above- 

20 mentioned hematopoietic cells and therefore find therapeutic utility in various stem cell disorders (such 
as those usually treated with transplantion, including, without Hmitation, aplastic anemia and paroxysmal 
nocturnal hemoglobinuria), as well as in repopulating the stem cell compartment post 
irradiation/chemotherapy, either in-vivo or ex-vivo (i.e., in conjunction with bone marrow 
transplantation or with peripheral progenitor cell transplantation (homologous or heterologous)) as 

25 normal cells or genetically manipulated for gene therapy. Alternatively, as described in more detail 
below, nucleic acids encoding these proteins or polypeptides or nucleic acids regulating the expression 
of these proteins or polypeptides may be introduced into appropriate host cells to increase or decrease the 
expression of the proteins as desired. 

30 EXAMPLE 27 

Assaving the Expressed Proteins or Polvneptides 
for Regulation of Tissue Growth 
The proteins or polypeptides encoded by the nucleic acids described above may also be 
evaluated for their effect on tissue growth. Numerous assays for such activity are familiar to those 
35 skilled in the art, including the assays disclosed in International Patent Publication No. W095/1 6035, 
International Patent Publication No. WO95/05846 and International Patent PubHcation No. 
WO91/0749L 
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Assays for wound healing activity include, without limitation, those described in: Winter, 
Epidermal Wound Healing, pps. 71-112 (Maibach, H I and Rovee, DT, eds,). Year Book Medical 
Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest. Dermatol 71:382-84 (1978). 

Those proteins or polypeptides which are involved in the regulation of tissue growth may then 
5 be formulated as pharmaceuticals and used to treat clinical conditions in which regulation of tissue 
growth is beneficial. For example, a protein or polypeptide may have utility in compositions used for 
bone, cartilage, tendon, ligament and/or nerve tissue growth or regeneration, as well as for wound 
healing and tissue repair and replacement, and in the treatment of bums, incisions and ulcers. 

A protein or polypeptide encoded by the nucleic acids described above which induces cartilage 
1 0 and/or bone growth in circumstances where bone is not normally formed, has application in the healing 
of bone fractures and cartilage damage or defects in humans and other animals. Such a preparation 
employing a protein or polypeptide of the invention may have prophylactic use in closed as well as open 
fincture reduction and also in the improved fixation of artificial joints. De novo bone synthesis induced 
by an osteogenic agent contributes to the repair of congenital, trauma induced, or oncologic resection 
1 5 induced craniofacial defects, and also is useful in cosmetic plastic surgery. 

A protein or polypeptide of this invention may also be used in the treatment of periodontal 
disease, and in other tooth repair processes. Such agents may provide an environment to attract bone- 
forming cells, stimulate growth of bone-forming cells or induce differentiation of progenitors of bone- 
forming cells. A protein of the invention may also be useful in the treatment of osteoporosis or 
20 osteoarthritis, such as through stimulation of bone and/or cartilage repair or by blocking inflammation or 
processes of tissue destruction (collagenase activity, osteoclast activity, etc.) mediated by inflammatory 
processes. 

Another category of tissue regeneration activity that may be attributable to the proteins or 
polypeptides encoded by the nucleic acids described above is tendon/ligament formation. A protein or 

25 polypeptide encoded by the nucleic acids described above, which induces tendon/ligament-like tissue or 
other tissue formation in circumstances where such tissue is not normally formed, has application in flie 
healing of tendon or ligament tears, deformities and other tendon or ligament defects in humans and 
other animals. Such a preparation employing a tendon/ligament-like tissue inducing protein may have 
prophylactic use in preventing damage to tendon or ligament tissue, as well as use in the improved 

30 fixation of tendon or ligament to bone or other tissues, and in repairing defects to tendon or ligament 
tissue. De novo tendon/ligament-like tissue formation induced by a protein or polypeptide of the present 
invention contributes to the repair of tendon or ligaments defects of congenital, traumatic or other origin 
and is also useful in cosmetic plastic surgery for attachment or repair of tendons or ligaments. The 
proteins or polypeptides of the present invaition may provide an environment to attract tendon- or 

35 ligament-forming cells, stimulate growth of tendon- or ligament-forming cells, induce differentiation of 
progenitors of tendon- or ligament-forming cells, or induce growth of tendon/ligament cells or 
progenitors ex vivo for return in vivo to effect tissue repair. The proteins or polypeptides of the 
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invention may also be useful in the treatment of tendinitis, carpal tunnel syndrome and other tendon or 
ligament defects. The therapeutic compositions may also include an appropriate matrix and/or 
sequestering agent as a carrier as is well known in the art. 

The proteins or polypeptides of the present invention may also be useful for proliferation of 
5 neural cells and for regeneration of nerve and brain tissue, i.e. , for the treatment of central and peripheral 
nervous system diseases and neuropathies, as well as mechanical and traumatic disorders, which involve 
degeneration, death or trauma to neural cells or nerve tissue. More specifically, a protein or polypeptide 
may be used in the treatment of diseases of the peripheral nervous system, such as peripheral nerve 
injuries, peripheral neuropathy and localized neuropathies, and central nervous system diseases, such as 

10 Alzheimer's, Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy-Drager 
syndrome. Further conditions which may be treated in accordance with the present invention include 
mechanical and traumatic disorders, such as spinal cord disorders, head trauma and cerebrovascular 
diseases such as stroke. Peripheral neuropathies resulting from chemotherapy or other medical therapies 
may also be treatable using a protein or polypeptide of the invention. 

1 5 Proteins or polypeptides of the invention may also be useful to promote better or faster closure 

of non-healing wounds, including without limitation pressure ulcers, ulcers associated with vascular 
insufficiency, surgical and traumatic wounds, and the like. 

It is expected that a protein or polypeptide of the present invention may also exhibit activity for 
generation or regeneration of other tissues, such as organs (including, for example, pancreas, liver, 

20 intestine, kidney, skin, endothelium) muscle (smooth, skeletal or cardiac) and vascular (including 
vascular endothelium) tissue, or for promoting the growth of cells comprising such tissues. Part of the 
desired effects may be by inhibition or modulation of fibrotic scarring to allow normal tissue to generate. 
A protein or polypeptide of the invention may also exhibit angiogenic activity. 

A protein or polypeptide of the present invention may also be useful for gut protection or 

25 regeneration and treatment of lung or liver fibrosis, reperfusion injury in various tissues, and conditions 
resulting from systemic cytokine damage. 

A protein or polypeptide of the present invention may also be useful for promoting or inhibiting 
differentiation of tissues described above from precursor tissues or cells; or for inhibiting the growth of 
tissues described above. 

30 Alternatively, as described in more detail below, nucleic acids encoding tissue growth regulating 

activity proteins or polypeptides or nucleic acids regulating the expression of such proteins or 
polypeptides may be introduced into appropriate host cells to increase or decrease the expression of the 
proteins as desired. 



35 



EXAMPLE 28 

Assaying the Expressed Proteins or Polvpeotides for 
Regulation of Reproductive Hormones 
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The proteins or polypeptides of the present invention may also be evaluated for their ability to 
regulate reproductive hormones, such as follicle stimulating hormone. Numerous assays for such 
activity are familiar to those skilled in the art, including the assays disclosed in the following references: 
VdX^etal, Endocrinol 91:562-572, 1972; Ling e/ a/., Afa/wre 321:779-782, 1986; VHc ei aL, Nature 
5 321:776-779, 1986; Mason et al.. Nature 318:659-663, 1985; Forage et al, Proc. Natl. Acad, Sci. USA 
83:3091-3095, 1986. Chapter 6.12 in Current Protocols in Immunology, J.E. Coligan et aL Eds. Greene 
Publishing Associates and Wiley-Intersciece ; Taub e/ a/. / ClinJnvest 95:1370-1376, 1995; LindeM/. 
APMIS 103: 140-146, 1995; Muller et al Eur J, Imtnunol 25:1744-1748; Gruber et al. J, Immunol 
152:5860-5867, 1994; Johnston et al, J Immunol 153:1762-1768, 1994. 

1 0 Those proteins or polypeptides which exhibit activity as reproductive hormones or regulators of 

cell movement may then be formulated as pharmaceuticals and used to treat clinical conditions in which 
regulation of reproductive hormones are beneficial. For example, a protein or polypeptide may exhibit 
activin- or inhibin-related activities. Inhibins are characterized by their ability to inhibit the release of 
follicle stimulating hormone (FSH), while activins are characterized by their ability to stimulate the 

1 5 release of FSH. Thus, a protein or polypeptide of the present invention, alone or in heterodimers with a 
member of the inhibin family, may be useful as a contraceptive based on the ability of inhibins to 
decrease fertility in female mammals and decrease spermatogenesis in male mammals. Administration 
of sufficient amounts of other inhibins can induce infertility in these mammals. Alternatively, the 
protein or polypeptide of the invention, as a homodimer or as a heterodimer with other protein subunits 

20 of the inhibin-B group, may be useful as a fertility inducing therapeutic, based upon the ability of activin 
molecules in stimulating FSH release from cells of the anterior pituitary. See, for example. United States 
Patent 4,798,885. A protein or polyp)eptide of the invention may also be useful for advancement of the 
onset of fertility in sexually immature mammals, so as to increase the hfetime reproductive performance 
of domestic animals such as cows, sheep and pigs. 

25 Alternatively, as described in more detail below, nucleic acids encoding reproductive hormone 

regulating activity proteins or polypeptides or nucleic acids regulating the expression of such proteins or 
polypeptides may be introduced into appropriate host cells to increase or decrease the expression of the 
proteins or polypeptides as desired. 



30 EXAMPLE 29 

Assaving the Expressed Proteins or Polvpeptides For Chemotactic/Chemokinetic Activity 
The proteins or polypeptides of the present invention may also be evaluated for 
chemotactic/chemokinetic activity. For example, a protein or polypeptide of the present invention may 
have chemotactic or chemokinetic activity (e.g., act as a chemokine) for mammalian cells, including, for 
35 example, monocytes, fibroblasts, neutrophils, T-cells, mast cells, eosinophils, epithelial and/or 

endothelial cells. Chemotactic and chemokinetic proteins or polypeptides can be used to mobilize or 
attract a desired cell population to a desired site of action. Chemotactic or chemokinetic proteins or 
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polypeptides provide particular advantages in treatment of wounds and other trauma to tissues, as well as 
in treatment of localized infections. For example, attraction of lymphocytes, monocytes or neutrophils to 
tumors or sites of infection may result in improved immune responses against the tumor or infecting 
agent. 

5 A protein or polypeptide has chemotactic activity for a particular cell population if it can 

stimulate, directly or indirectly, the directed orientation or movement of such cell population. 
Preferably, the protein or polypeptide has the ability to directly stimulate directed movement of cells. 
Whether a particular protein or polypeptide has chemotactic activity for a population of cells can be 
readily determined by employing such protein or polypeptide in any known assay for cell chemotaxis. 

10 The activity of a protein or polypeptide of the invention may, among other means, be measured 

by the following methods: 

Assays for chemotactic activity (which will identify proteins or polypeptides that induce or 
prevent chemotaxis) consist of assays that measure the ability of a protein or polypeptide to induce the 
migration of cells across a membrane as well as the ability of a protein or polypeptide to induce the 

1 5 adhesion of one cell peculation to another cell population. Suitable assays for movement and adhesion 
include, without limitation, those described in: Current Protocols in Immunology, Ed by J.E. Coligan, 
A.M, Kruisbeek, D.H. Margulies, E.M. Shevach, W. Strober, Pub. Greene Publishing Associates and 
Wiley-Interscience,Chapter 6J2: 6.12.1-6.12.28; Taube/a/. J. Clin, Invest 95:1370-1376, 1995; Linde/ 
al APMIS 103:140-146, 1995; Mueller et ai, Eur. J. Immunol. 25:1744-1748; Gruber et al J. Immunol 

20 152:5860-5867, 1994; Johnston et al J, Immunol, 153:1762-1768, 1994. 

EXAMPLE 3D 

Assaying the Expressed Proteins or Polypeptides for Regulation of Blood Clotting 
The proteins or polypeptides of the present invention may also be evaluated for their effects on 

25 blood clotting. Numerous assays for such activity are familiar to those skilled in the art, including the 
assays disclosed in the following references: Linet etal, J. Clin. Pharmacol 26:131-140, 1986; Burdick 
et al. Thrombosis Res, 45:413-419, 1987; Humphrey et al. Fibrinolysis 5:71-79 (1991); Schaub, 
Prostaglandins 35:467-474, 1988. 

Those proteins or polypeptides which are involved in the regulation of blood clotting may then 

30 be formulated as pharmaceuticals and used to treat clinical conditions in which regulation of blood 

clotting is beneficial. For example, a protein or polypeptide of the invention may also exhibit hemostatic 
or thrombolytic activity. As a result, such a protein or polypeptide is expected to be useful in treatment 
of various coagulations disorders (including hereditary disorders, such as hemophilias) or to enhance 
coagulation and other hemostatic events in treating wounds resulting from trauma, surgery or other 

35 causes. A protein or polypeptide of the invention may also be useful for dissolving or inhibiting 
formation of thromboses and for treatment and prevention of conditions resulting therefrom (such as 
infarction of cardiac and central nervous system vessels (e.g., stroke)). Alternatively, as described in 
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more detail below, nucleic acids encoding blood clotting activity proteins or polypeptides or nucleic 
acids regulating the expression of such proteins or polypeptides may be introduced into appropriate host 
cells to increase or decrease the expression of the proteins or polypeptides as desired. 

5 EXAMPLE 31 

Assaying the Expressed Proteins or Polypeptides for Involvement in 
Receptor/Lipand Interactions 
The proteins or polypeptides of the present invention may also be evaluated for their 
involvement in receptor/ligand interactions. Numerous assays for such involvement are familiar to those 

1 0 skilled in the art, including the assays disclosed in the following references: Chapter 7. 7.28. 1 -7.28.22) in 
Current Protocols in Immunologyy J.E. Coligan et al. Eds. Greene Publishing Associates and Wiley- 
Interscience; Takai et al., Proc, Natl Acad. ScL USA 84:6864-6868, 1987; Bierer et ai, J, Exp. Med. 
168:1145-1156, 1988; Rosensteine/ a/., /£c/>. A/erf. 169:149-160, 1989; Sio]tsnhorg et ai, J. Jnununol. 
Methods 175:59-68. 1994; Stitt et aL Cell 80:661-670, 1995; Gyuris et al. Cell 75:791-803, 1993. 

1 5 For example, the proteins or polypeptides of the present invention may also demonstrate activity 

as receptors, receptor ligands or inhibitors or agonists of receptor/ligand interactions. Examples of such 
receptors and ligands include, without limitation, cytokine receptors and their ligands, receptor kinases 
and their ligands, receptor phosphatases and their ligands, receptors involved in cell-cell interactions and 
their ligands (including without limitation, cellular adhesion molecules (such as selectins, integrins and 

20 their ligands) and receptor/ligand pairs involved in antigen presentation, antigen recognition and 
development of cellular and humoral immune responses). Receptors and ligands are also useful for 
screening of potential peptide or small molecule inhibitors of the relevant receptor/ligand interaction. A 
protein or polypeptide of the present invention (including, without limitation, fragments of receptors and 
ligands) may be useful as inhibitors of receptor/ligand interactions. Alternatively, as described in more 

25 detail below, nucleic acids encoding proteins or polypeptides involved in receptor/ligand interactions or 
nucleic acids regulating the expression of such proteins or polypeptides may be introduced into 
appropriate host cells to increase or decrease the expression of the proteins or polypeptides as desired. 

EXAMPLE 32 

30 Assaying the Proteins or Polypeptides for Anti-Inflammatorv Activity 

The proteins or polypeptides of the present invention may also be evaluated for anti- 
inflammatory activity. The anti-inflammatory activity may be achieved by providing a stimulus to 
cells involved in the inflammatory response, by inhibiting or promoting cell-cell interactions (such 
as, for example, cell adhesion), by inhibiting or promoting chemotaxis of cells involved in the 

35 inflammatory process, inhibiting or promoting cell extravasation, or by stimulating or suppressing 
production of other factors which more directly inhibit or promote an inflammatory response. 
Proteins or polypeptides exhibiting such activities can be used to treat inflammatory conditions 
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including chronic or acute conditions, including without limitation inflammation associated with 
infection (such as septic shock, sepsis or systemic inflammatory response syndrome), ischemia- 
reperfusioninury, endotoxin lethality, arthritis, complement-mediated hyperacute rejection, nephritis, 
cytokine- or chemokine-induced lung injury, inflammatory bowel disease, Crohn's disease or 
5 resulting from over production of cytokines such as TNF or IL-1 . Proteins or polypeptides of the 
invention may also be useful to treat anaphylaxis and hypersensitivity to an antigenic substance or 
material. Alternatively, as described in more detail below, nucleic acids encoding anti-inflammatory 
activity proteins or polypeptides or nucleic acids regulating the expression of such proteins or 
polypeptides may be introduced into appropriate host cells to increase or decrease the expression of the 
10 proteins or polypeptides as desired. 

EXAMPLE 33 

Assaying the Expressed Proteins or Poivpeotides for tumor Inhibition Activity 
The proteins or polypeptides of the present invention may also be evaluated for tumor inhibition 

1 5 activity. In addition to the activities described above for immunological treatment or prevention of 
tumors, a protein or polypeptide of the invention may exhibit other anti-tumor activities. A protein or 
polypeptide may inhibit tumor growth directly or indirectly (such as, for example, via ADCC). A 
protein or polypeptide may exhibit its tumor inhibitory activity by acting on tumor tissue or tumor 
precursor tissue, by inhibiting formation of tissues necessary to support tumor growth (such as, for 

20 example, by inhibiting angiogenesis), by causing production of other factors, agents or cell types which 
inhibit tumor growth, or by suppressing, eliminating or inhibiting factors, agents or cell types which 
promote tumor growth. . Alternatively, as described in more detail below, nucleic acids encoding 
proteins or polypeptides with tumor inhibition activity or nucleic acids regulating the expression of such 
proteins or polypeptides may be introduced into appropriate host cells to mcrease or decrease the 

25 expression of the proteins or polypeptides as desired. 

A protein or polypeptide of the invention may also exhibit one or more of the following 
additional activities or effects: inhibiting the growth, infection or function of, or killing, infectious 
agents, including, without limitation, bacteria, viruses, fungi and other parasites; effecting (suppressing 
or enhancing) bodily characteristics, including, without limitation, height, weight, hair color, eye color, 

30 skin, fat to lean ratio or other tissue pigmentation, or organ or body part size or shape (such as, for 
example, breast augmentation or diminution, change in bone form or shape); effecting biorhythms or 
circadian cycles or rfiythms; effecting the fertility of male or female subjects; effecting the metabolism, 
catabolism, anabolism, processing, utilization, storage or climmation of dietary fat, lipid, protein, 
carbohydrate, vitamins, minerals, cofactors or other nutritional factors or component(s); effecting 

35 behavioral characteristics, including, without Umitation, appetite, libido, stress, cognition (including 
cognitive disorders), depression (including depressive disorders) and violent behaviors; providing 
analgesic effects or other pain reducing effects; promoting differentiation and growth of embryonic stem 
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cells in lineages other than hematopoietic lineages; hormonal or endocrine activity; in the case of 
enzymes, correcting deficiencies of the enzyme and treating deficiency-related diseases; treatment of 
hyperproliferative disorders (such as, for example, psoriasis); immunoglobulin-like activity (such as, for 
example, the ability to bind antigens or complement); and the ability to act as an antigen in a vaccine 
5 composition to raise an immune response against such protein or another material or entity which is 
cross-reactive with such protein. Alternatively, as described in more detail below, nucleic acids 
encoding proteins or polypeptides involved in any of the above mentioned activities or nucleic acids 
regulating the expression of such proteins may be introduced into appropriate host cells to increase or 
decrease the expression of the proteins or polypeptides as desired. 

10 

EXAMPLE 34 

Identification of Proteins or Polvpeptides whi ch Interact with 
Proteins or Polvpeptides of the Present Invention 
Proteins or polypeptides which interact with the proteins or polypeptides of the present 

1 5 invention, such as receptor proteins, may be identified using two hybrid systems such as the Matchmaker 
Two Hybrid System 2 (Catalog No. K1604-1, Clontech). As described in the manual accon^anying the 
Idt, nucleic acids encoding the proteins or polypeptides of the present invention, arc inserted into an 
expression vector such that they are in frame with DNA encoding the DNA binding domain of the yeast 
transcriptional activator GAL4. cDNAs in a cDNA library which encode proteins or polypeptides which 

20 might interact with the proteins or polypeptides of the present invention are inserted into a second 

expression vector such that they are in frame with DNA encoding the activation domain of GAL4. The 
two expression plasmids are transformed into yeast and the yeast are plated on selection medium which 
selects for expression of selectable markers on each of the expression vectors as well as GAL4 
dependent expression of the HIS3 gene. Transformants capable of growing on medium lacking histidine 

25 are screened for GAL4 dependent lacZ expression. Those cells which are positive in both the histidine 
selection and the lacZ assay contain plasmids encoding proteins or polypeptides which interact with the 
proteins or polypeptides of the present invention. 

Alternatively, the system described in Lustig et ai. Methods in Enzymology 283: 83-99 (1997) 
may be used for identifying molecules which interact with the proteins or polypeptides of the present 

30 invention, hi such systems, in vitro transcription reactions are performed on a pool of vectors containing 
nucleic acid inserts which encode the proteins or polypeptides of the present invention. The nucleic acid 
inserts are cloned downstream of a promoter which drives in vitro transcription. The resulting pools of 
mRNAs arc introduced into Xenopus laevis oocytes. The oocytes are then assayed for a desired activity. 
Alternatively, the pooled in vitro transcription products produced as described above may be 

35 translated in vitro. The pooled in vitro translaticm products can be assayed for a desired activity or for 
interaction with a known protein or polypeptide. 
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Proteins, polypeptides or other molecules interacting with proteins or polypeptides of the 
present invention can be found by a variety of additional techniques. In one method, affinity 
columns containing the protein or polypeptide of the present invention can be constructed, in some 
versions, of this method the affinity column contains chimeric proteins in which the protein or 
5 polypeptide of the present invention is fused to glutathione S-transferase. A mixture of cellular 
proteins or pool of expressed proteins as described above and is applied to the affinity column. 
Molecules interacting with the protein or polypeptide attached to the column can then be isolated and 
analyzed on 2-D electrophoresis gel as described in Ramunsen et al Electrophoresis, 18, 588-598 
(1997). Alternatively, the molecules retained on the affinity column can be purified by 
10 electrophoresis based methods and sequenced. The same method can be used to isolate antibodies, to 
screen phage display products, or to screen phage display human antibodies. 

Molecules interacting with the proteins or polypeptides of the present invention can also be 
screened by using an Optical Biosensor as described in Edwards & Leatherbarrow, Analytical 
Biochemistry, 246, 1-6 (1997). The main advantage of the method is that it allows the determination 
1 5 of the association rate between the protein or polypeptide and other interacting molecules. Thus, it is 
possible to specifically select interacting molecules with a high or low association rate. Typically a 
target molecule is linked to the sensor surface (through a carboxymethl dextran matrix) and a sample 
of test molecules is placed in contact with the target molecules. The binding of a test molecule to the 
target molecule causes a change in the refractive index and/ or thickness. This change is detected by 
20 the Biosensor provided it occurs in the evanescent field (which extends a few hundred nanometers 
from the sensor surface). In these screening assays, the target molecule can be one of the proteins or 
polypeptides of the present invention and the test sample can be a collection of proteins, polypeptides 
or other molecules extracted from tissues or cells, a pool of expressed proteins, combinatorial peptide 
and/ or chemical libraries, or phage displayed peptides. The tissues or cells from which the test 
25 molecules are extracted can originate from any species. 

In other methods, a target protein or polypeptide is immobilized and the test population is a 
collection of unique proteins or polypeptides of the present invention. 

To study the interaction of the proteins or polypeptides of the present invention with drugs, 
the microdialysis coupled to HPLC method described by Wang et al, Chromatographia, 44, 205- 
30 208( 1 997) or the affinity capillary electrophoresis method described by Busch et al , J. Chromatogr, 
777:311-328 (1997)canbe used. 

The system described in U.S. Patent No. 5,654,150 may also be used to identify molecules 
which interact with the proteins or polypeptides of the present invention. In this system, pools of nucleic 
acids encoding the proteins or polypeptides of the present invention are transcribed and translated in 
35 vitro and the reaction products are assayed for interaction with a known polypeptide or antibody. 

It will be appreciated by those skilled in the art that the proteins or polypeptides of the present 
invention may be assayed for numerous activities in addition to those specifically enumerated above. 
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For example, the expressed proteins or polypeptides may be evaluated for applications involving control 
and regulation of inflammation, tumor proliferation or metastasis, infection, or other clinical conditions. 
In addition, the proteins or polypeptides may be useful as nutritional agents or cosmetic agents. 

The proteins or polypeptides of the present invention may be used to generate antibodies 

5 capable of specifically binding to the proteins or polypeptides of the present invention. The 

antibodies may be monoclonal antibodies or polyclonal antibodies. As used herein, "antibody" refers 
to a polypeptide or group of polypeptides which are comprised of at least one binding domain, where 
a binding domain is formed from the folding of variable domains of an antibody molecule to form 
three-dimensional binding spaces with an internal surface shape and charge distribution 

10 complementary to the features of an antigenic determinant of an antigen., which allows an 

immunological reaction with the antigen. Antibodies include recombinant proteins comprising the 
binding domains, as wells as fragments, including Fab, Fab', F(ab)2, and F(ab')2 fragments. 

As used herein, an "antigenic determinant" is the portion of an antigen molecule, that 
determines the specificity of the antigen-antibody reaction. An "epitope" refers to an antigenic 

15 determinant of a polypeptide. An epitope can comprise as few as 3 amino acids in a spatial 

conformation which is unique to the epitope. Generally an epitope consists of at least 6 such amino 
acids, and more usually at least 8-10 such amino acids. Methods for determining the amino acids 
which make up an epitope include x-ray crystallography, 2-dimensional nuclear magnetic resonance, 
and epitope mapping e.g. the Pepscan method described by H. Mario Geysen et al 1984. Proc. Natl. 

20 Acad. Sci. U.S.A. 8 1 :3998-4002; PCT Publication No. WO 84/03564; and PCT Publication No. WO 
84/03506. 

In some embodiments, the antibodies may be capable of specifically binding to a protein or 
polypeptide encoded by EST-related nucleic acids, fragments of EST-related nucleic acids, positional 
segments of EST-related nucleic acids or fragments of positional segments of EST-related nucleic acids. 
25 In some embodiments, the antibody may be capable of binding an antigenic determinant or an epitope in 
a protein or polypeptide encoded by EST-related nucleic acids, fragments of EST-related nucleic acids, 
positional segments of EST-related nucleic acids or fragments of positional segments of EST-related 
nucleic acids. 

In other embodiments, the antibodies may be capable of specifically binding to an EST-related 
30 polypeptide, fragment of an EST-related polypeptide, positional segment of an EST-related polypeptide 
or fragment of a positional segment of an EST-related polypeptide. In some embodiments, the antibody 
may be capable of binding an antigenic determinant or an epitope in an EST-related polypeptide, 
fragment of an EST-related polypeptide, positional segment of an EST-related polypeptide or fragment 
of a positional segment of an EST-related polypeptide. 
35 In the case of secreted proteins, the antibodies may be capable of binding a full-length protein 

encoded by a nucleic acid of the present invention, a mature protein (i.e. the protein generated by 
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cleavage of the signal peptide) encoded by a nucleic acid of the present invention, or a signal peptide 
encoded by a nucleic acid of the present invention. 



EXAMPLE 35 

5 ^ Production of an Antibody to a Human Polypep tide or Protein 

The above described EST-related nucleic acids, fragments of EST-related nucleic acids, 
positional segments of EST-related nucleic acids or fragments of positional segments of EST-related 
nucleic acids or nucleic acids encoding EST-related polypeptides, fragments of EST-related 
polypeptides, positional segments of EST-related polypeptides or fragments of positional segments of 

10 EST-related polypeptides are operably linked to promoters and introduced into cells as described above. 

In the case of secreted proteins, nucleic acids encoding the full protein (i.e. the mature protein 
and the signal peptide), nucleic acids encoding the mature protein {i.e. the protein generated by cleavage 
of the signal peptide), or nucleic acids encoding the signal peptide are operably linked to promoters and 
introduced into cells as described above. 

1 5 The encoded proteins or polypeptides are then substantially purified or isolated as described 

above. The concentration of protein in the final preparation is adjusted, for example, by concentration on 
an Amicon filter device, to the level of a few pg/ml. Monoclonal or polyclonal antibody to the protein or 
polypeptide can then be prepared as follows: 
1 ■ Monoclonal Antibody Production bv Hvbridoma Fusion 

20 Monoclonal antibody to epitopes of any of the proteins or polypeptides identified and isolated as 

described can be prepared from murine hybridomas according to the classical method of Kohler, and 
Milstein, Nature 256:495 (1975) or derivative methods thereof. Briefly, a mouse is repetitively 
inoculated with a few micrograms of the selected protein or peptides derived therefrom over a period of 
a few weeks. The mouse is then sacrificed, and the antibody producing cells of the spleen isolated. The 

25 spleen cells are fused by means of polyethylene glycol with mouse myeloma cells, and the excess 
unfused cells destroyed by growth of the system on selective media comprising aminopterin (HAT 
media). The successfully fused cells are diluted and aliquots of the dilution placed in wells of a 
microtiter plate where growth of the culture is continued. Antibody-producing clones are identified by 
detection of antibody in the supernatant fluid of the wells by immunoassay procedures, such as Elisa, as 

30 originally described by Engvall, Meth. Enzymol. 70:419 (1980). Selected positive clones can be 

expanded and their monoclonal antibody product harvested for use. Detailed procedures for monoclonal 
antibody production are described in Davis, L. et al. in Basic Methods in Molecular Biology Elsevier. 
New York. Section 21-2. 

2. Polyclonal Antibody Production bv Immunization 
35 Polyclonal antiserum containing antibodies to heterogenous epitopes of a single protein or 

polypeptide can be prepared by immunizing suitable animals with the expressed protein or peptides 
derived therefrom, which can be unmodified or modified to enhance immunogenicity. Effective 
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polyclonal antibody production is affected by many factors related both to the antigen and the host 
species. For example, small molecules tend to be less immunogenic than others and may require the use 
of carriers and adjuvant. Also, host animals response vary depending on site of inoculations and doses, 
with both inadequate or excessive doses of antigen resulting in low titer antisera. Small doses (ng level) 
5 of antigen administered at multiple intradermal sites appears to be most reliable. An effective 

immunization protocol for rabbits can be found in Vaitukaitis. et aU. Clin, Endocrinol Metab. 33:988- 
991(1971). 

Booster injections can be given at regular intervals, and antiserum harvested when antibody titer 
thereof, as determined semi-quantitatively, for example, by double immunodiffusion in agar against 

10 known concentrations of the antigen, begins to fall. See, for example, Ouchterlony, et al, Oiap. 19 in: 
Handbook of Experimental Immunology D. Wier (ed) Blackwell (1973). Plateau concentration of 
antibody is usually in the range of 0.1 to 0.2 mg/ml of serum (about 12 ^iM). Affinity of the antisera for 
the antigen is determined by preparing competitive binding curves, as described, for example, by Fisher, 
D., Chap. 42 in: Manual of Clinical Immunology, 2d Ed. (Rose and Friedman, Eds.) Amer. Soc. For 

15 Microbiol., Washington, D.C. (1 980). 

Antibody preparations prepared according to either of the above protocols are useful in a variety 
of contexts. In particular, the antibodies may be used in immunoaffinity chromatography techniques 
such as those described below to facilitate large scale isolation, purification, or enrichment of the 
proteins or polypeptides encoded by EST-related nucleic acids, positional segments of EST-related 

20 nucleic acids or fragments of positional segments of EST-related nucleic acids or for the isolation, 
purification or enrichment of EST-related polypeptides, fragments of EST-related polypeptides, 
positional segments of EST-related polypeptides or fragments of positional segments of EST-related 
polypq}tides. 

In the case of secreted proteins, the antibodies may be used for the isolation, purification, or 
25 enrichment of the full protein (Le. the mature protein and the signal peptide), the mature protein {i.e. the 
protein generated by cleavage of the signal peptide), or the signal peptide are operably linked to 
promoters and introduced into cells as described above. 

Additionally, the antibodies may be used in inmiunoaflinity chromatography techniques such as 
those described below to isolate, purify, or enrich polypeptides which have been linked to the proteins or 
30 polypeptides encoded by EST-related nucleic acids, positional segments of EST-related nucleic acids or 
fragments of positional segments of EST-related nucleic acids or to isolate, purify, or enrich EST-related 
polypeptides, fragments of EST-related polypeptides, positional segments of EST-related polypeptides 
or fragments of positional segments of EST-related polypeptides. 

The antibodies may also be used to determine the cellular localization of polypeptides encoded 
35 by the proteins or polypeptides encoded by EST-related nucleic acids, positional segments of EST- 
reiated nucleic acids or fragments of positional segments of EST-related nucleic acids or the cellular 
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localization of EST-related polypq)tides, fragments of EST-related polypeptides, positional segments of 
EST-related polypeptides or fragments of positional segments of EST-related polypeptides. 

In addition, the antibodies may also be used to determine the cellular localization of 
polypeptides which have been linked to the proteins or polypeptides encoded by EST-related nucleic 
5 acids, positional segments of EST-related nucleic acids or fragments of positional segments of EST- 
related nucleic acids or polypeptides which have been linked to EST-related polypeptides, fragments of 
EST-related polypeptides, positional segments of EST-related polypeptides or fragments of positional 
segments of EST-related polypeptides . 

The antibodies may also be used in quantitative immunoassays which determine concentrations 
1 0 of antigen-bearing substances in biological samples; they may also used semi-quantitatively or 

qualitatively to identify the presence of antigen in a biological sample or to identify the type of tissue 
present in a biological sample. The antibodies may also be used in therapeutic compositions for killing 
cells expressing the protein or reducing the levels of the protein in the body. 

15 VI. Use of STSTs or Consensus Contigated 5' ESTs or Sequences Obtainable Therefrom or 
Portions Thereof as Reagents 

The EST-related nucleic acids, positional segments of EST-related nucleic acids or fragments of 
positional segments of EST-related nucleic acids may be used as reagents in isolation procedures, 
diagnostic assays, and forensic procedures. For example, sequences from the EST-related nucleic acids, 
20 positional segments of EST-related nucleic acids or fragments of positional segments of EST-related 
nucleic acids, may be detectably labeled and used as probes to isolate other sequences capable of 
hybridizing to them. In addition, the EST-related nucleic acids, positional segments of EST-related 
nucleic acids or fragments of positional segments of EST-related nucleic acids may be used to design 
PCR primers to be used in isolation, diagnostic, or forensic procedures. 

25 

1. Use of EST-related nucleic acids, positional segments of EST-related nucleic acids or fragments of 
positional segments of EST-related nucleic acids in isolation, diagnostic and forensic procedures 

EXAMPLE 36 
Preparation of PCR Primers and Amplification of DNA 

30 The EST-related nucleic acids, positional segments of EST-related nucleic acids or fragments of 

positional segments of EST-related nucleic acids may be used to prepare PCR primers for a variety of 
applications, including isolation procedures for cloning nucleic acids capable of hybridizing to such 
sequences, diagnostic techniques and forensic techniques. In some embodiments, the PCR primers at 
least 10, 15, 18. 20, 23, 25, 28, 30, 40, or 50 nucleotides in length. In some embodiments, the PCR 

35 primers may be more than 30 bases in length. It is preferred that the primer pairs have approximately the 
same G/C ratio, so that melting temperatures are ai)proximately the same. A variety of PCR techniques 
are familiar to those skilled in the art. For a review of PCR technology, see Molecular Qoning to 
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Genetic Engineering White, B.A. Ed. in Methods in Molecular Biology 67: Humana Press, Totowa 1997. 
In each of these PGR procedures, PGR primers on either side of the nucleic acid sequences to be 
ampHfied are added to a suitably prepared nucleic acid sample along with dNTPs and a thermostable 
polymerase such as Taq polymerase, Pfu polymerase, or Vent polymerase. The nucleic acid in the 
5 sample is denatured and the PGR primers are specifically hybridized to complementary nucleic acid 
sequences in the sample. The hybridized primers are extended. Thereafter, another cycle of 
denaturation, hybridization, and extension is initiated. The cycles are repeated multiple times to produce 
an amplified fragment containing the nucleic acid sequence between the primer sites. 



10 EXAMPLE 37 

Use of the EST-rclated nucleic acids> positional segments of EST-related nucleic acids or fi-agments of 
positional segments of EST-related nucleic acids as probes 
Probes derived fi-om EST-related nucleic acids, positional segments of EST-related nucleic acids 
or Augments of positional segments of EST-related nucleic acids may be labeled with detectable labels 

1 5 familiar to those skilled in the art, including radioisotopes and non-radioactive labels, to provide a 
detectable probe. The detectable probe may be single stranded or double stranded and may be made 
using techniques known in the art, including in vitro transcription, nick translation, or kinase reactions. 
A nucleic acid sample containing a sequence capable of hybridizing to the labeled probe is contacted 
with the labeled probe. If the nucleic acid in the sample is double stranded, it may be denatured prior to 

20 contacting the probe. In some applications, the nucleic acid sample may be immobilized on a surface 
such as a nitrocellulose or nylon membrane. The nucleic acid sample may compose nucleic acids 
obtained fi-om a variety of sources, including genomic DNA, cDNA libraries, RNA, or tissue samples. 

Procedures used to detect the presence of nucleic acids capable of hybridizing to the detectable 
probe include well known techniques such as Southern blotting, Northern blotting, dot blotting, colony 

25 hybridization, and plaque hybridization. In some applications, the nucleic acid capable of hybridizing to 
the labeled probe may be cloned into vectors such as expression vectors, sequencing vectors, or in vitro 
transcription vectors to facilitate the characterization and expression of the hybridizing nucleic acids in 
the sample. For example, such techniques may be used to isolate and clone sequences in a genomic 
library or cDNA library which are capable of hybridizing to the detectable probe as described in 

30 Example 20 above. 

PGR primers made as described in Example 36 above may be used in forensic analyses, such as 
the DNA fingerprinting techniques described in Examples 38-42 below. Such analyses may utilize 
detectable probes or primers based on the sequences of the EST-related nucleic acids, positional 
segments of EST-related nucleic acids or firagments of positional segments of EST-related nucleic acids. 



EXAMPLE 38 

Forensic Matching bv DNA Sequencing 
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In one exemplary method, DNA samples are isolated from forensic specimens of, for example, 
hair, semen, blood or skin cells by conventional methods. A panel of PGR primers based on a number of 
the EST-related nucleic acids, positional segments of EST-related nucleic acids or fragments of 
positional segments of EST-related nucleic acids is then utilized in accordance with Example 36 to 
5 amplify DNA of approximately 100-200 bases in length from the forensic specimen. Corresponding 
sequences are obtained from a test subject. Each of these identification DNAs is then sequenced using 
standard techniques, and a simple database comparison determines the differences, if any, between the 
sequences from the subject and those from the sample. Statistically significant differences between the 
suspect's DNA sequences and those from the sample conclusively prove a lack of identity. This lack of 
1 0 identity can be proven, for example, with only one sequence. Identity, on the other hand, should be 
demonstrated with a large number of sequences, all matching. Preferably, a minimum of 50 statistically 
identical sequences of 100 bases in length are used to prove identity between the suspect and the sample. 

EXAMPLE 39 

15 Positive Identification bv DNA Sequencing 

The technique outlined in the previous example may also be used on a larger scale to provide a 
unique fingerprint-type identification of any individual. In this technique, primers are prepared from a 
large number of EST-related nucleic acids, positional segments of EST-related nucleic acids or 
fragments of positional segments of EST-related nucleic acids. Preferably, 20 to 50 different primers are 

20 used. These primers are used to obtain a corresponding number of PCR-generated DNA segments from 
the individual in question in accordance with Example 34. Each of these DNA segments is sequenced, 
using the methods set forth in Example 36. The database of sequences generated through this procedure 
uniquely identifies the individual from whom the sequences were obtained. The same panel of primers 
may then be used at any later time to absolutely correlate tissue or other biological specimen with that 

25 individual. 

EXAMPLE 40 
Southern Blot Forensic Identification 
The procedure of Example 38 is repeated to obtain a panel of at least 10 amplified sequences 
30 from an individual and a specimen. Preferably, the panel contains at least 50 amplified sequences. More 
preferably, the panel contains 100 amplified sequences. In some embodiments, the panel contains 200 
amplified sequences. This PCR-generated DNA is then digested with one or a combination of, 
preferably, four base specific restriction enzymes. Such enz3mies are commercially available and known 
to those of skill in the art. After digestion, the resultant gene fiagments are size separated in multiple 
35 duplicate wells on an agarose gel and transferred to nitrocellulose using Southem blotting techniques 
well known to those with skill in the art. For a review of Southem blotting see Davis et al (Basic 
Methods in Molecular Biology, 1986, Elsevier Press, pp 62-65). 
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A panel of probes based on the sequences of the EST-related nucleic acids, positional segments 
of EST-related nucleic acids or fragments of positional segments of EST-related nucleic acids are 
radioactively or colorimetrically labeled using methods known in the art, such as nick translation or end 
labeling, and hybridized to the Southern blot using techniques known in the art (Davis et al, supra). 
5 Preferably, the probe is at least 10, 12, 15, 18, 20, 25, 28, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400 or 
500 nucleotides in length. Preferably, the probes are at least 10, 12, 15, 18, 20, 25, 28, 30, 35, 40, 50, 75, 
100, 150, 200, 300, 400 or 500 nucleotides in length. In some embodiments, the probes are 
oligonucleotides which are 40 nucleotides in length or less. 

Preferably, at least 5 to 10 of these labeled probes are used, and more preferably at least about 

10 20 or 30 are used to provide a unique pattern. The resultant bands appearing from the hybridization of a 
large sample of EST-related nucleic acids, positional segments of EST-related nucleic acids or fragments 
of positional segments of EST-related nucleic acids will be a unique identifier. Since the restriction 
enzyme cleavage will be different for every individual, the band pattern on the Southern blot will also be 
unique. Increasing the number of probes will provide a statistically higher level of confidence in the 

1 5 identification since there will be an increased number of sets of bands used for identification. 

EXAMPLE 41 
Dot Blot Identification Procedure 
Another technique for identifying individuals using the EST-related nucleic acids, positional 

20 segments of EST-related nucleic acids or fragments of positional segments of EST-related nucleic acids 
disclosed herein utilizes a dot blot hybridization technique. 

Genomic DNA is isolated from nuclei of subject to be identified. Probes are prepared that 
correspond to at least 10, preferably 50 sequences from the EST-related nucleic acids, positional 
segments of EST-related nucleic acids or fragments of positional segments of EST-related nucleic acids. 

25 The probes are used to hybridize to the genomic DNA through conditions known to those in the art. The 
oligonucleotides are end labeled with P^^ using polynucleotide kinase (Pharmacia). Dot Blots are created 
by spotting the genomic DNA onto nitrocellulose or the like using a vacuum dot blot manifold (BioRad, 
Richmond California). The nitrocellulose filter containing the genomic sequences is baked or UV linked 
to the filter, prehybridized and hybridized with labeled probe using techniques known in the art (Davis et 

30 a/. , supra). The ^¥ labeled DNA fragments are sequentially hytwidized with successively stringent 
conditions to detect minimal differences between the 30 bp sequence and the DNA. 
Tetramcthylammonium chloride is usefiil for identifying clones containing small numbers of nucleotide 
mismatches (Wood et ai. Proc. Natl, Acad. ScL USA 82(6):1 585-1588 (1985)). A unique pattern of dots 
distinguishes one individual from another individual. 

35 EST-related nucleic acids, positional segments of EST-related nucleic acids or fragments of 

positional segments of EST-related nucleic acids can be used as probes in the following alternative 
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fingerprinting technique. In some embodiments, the probes are oligonucleotides which are 40 
nucleotides in length or less. 

Preferably, a plurality of probes having sequences from different EST-related nucleic acids, 
positional segments of EST-related nucleic acids or fragments of positional segments of EST-related 
5 nucleic acids are used in the alternative fingerprinting technique. Example 42 below provides a 
representative alternative fingerprinting procedure in which the probes are derived from EST-related 
nucleic acids, positional segments of EST-related nucleic acids or Segments of positional segments of 
EST-related nucleic acids. 

10 EXAMPLE 42 

Alternative "Fingerprint" Identification Technique 
Oligonucleotides are prepared from a large number, e.g. 50, 100, or 200, EST-related nucleic 
acids, positional segments of EST-related nucleic acids or fragments of positional segments of EST- 
related nucleic acids using commercially available oligonucleotide services such as Genset, Paris, 
1 5 Fiance. Preferably, the oligonucleotides are at least 1 0, 1 5, 1 8, 20, 23, 25 28, or 30 nucleotides in length. 
However, in some embodiments, the oligonucleotides may be more than 40, 50, 60 or 70 nucleotides in 
length. 

Cell samples from the test subject are processed for DNA using techniques well known to those 
with skill in the art. The nucleic acid is digested with restriction enzymes such as EcoRI and Xbal. 

20 Following digestion, samples are applied to wells for electrophoresis. The procedure, as known in the 
art, may be modified to accommodate polyacrylamide electrophoresis, however in this example, samples 
containing 5 ug of DNA are loaded into wells and separated on 0.8% agarose gels. The gels are 
transferred onto nitrocellulose using standard Southern blotting techniques. 

10 ng of each of the oligonucleotides are pooled and end-labeled with The nitrocellulose is 

25 prehybridized with blocking solution and hybridized with the labeled probes. Following hybridization 
and washing, the nitrocellulose filter is exposed to X-Omat AR X-ray film. The resulting hybridization 
pattern will be unique for each individual. 

It is additionally contemplated within this example that the number of probe sequences used can 
be varied for additional accuracy or clarity. 

30 In addition to their applications in forensics and identification, EST-related nucleic acids, 

positional segments of EST-related nucleic acids or fragments of positional segments of EST-related 
nucleic acids may be mapped to their chromosomal locations. Example 41 below describes radiation 
hybrid (RH) mapping of human chromosomal regions using EST-related nucleic acids, positional 
segments of EST-related nucleic acids or fragments of positional segments of EST-related nucleic acids. 

35 Example 42 below describes a representative procedure for mapping EST-related nucleic acids, 
positional segments of EST-related nucleic acids or fragments of positional segments of ESTnrelated 
nucleic acids to their locations on human chromosomes. Example 43 below describes mapping of 
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EST-related nucleic acids, positional segments of EST-related nucleic acids or fragments of positional 
segments of EST-related nucleic acids on metaphase chromosomes by Fluorescence In Situ 
Hybridization (FISH). 

5 2. Use of EST-related nucleic acids, positional segments of EST-related nucleic acids or fragments of 
positional segments of EST-related nucleic acids in Chromosome Mapping 

EXAMPLE 43 

Radiation hybrid mapping of EST-related nucleic acids, positional segments of 
EST-related nucleic acids or fragments of positional segments of 

10 EST-related nucleic acids to the human genome 

Radiation hybrid (RH) mapping is a somatic cell genetic approach that can be used for high 
resolution mapping of the human genome. In this approach, cell lines containing one or more human 
chromosomes are lethally irradiated, breaking each chromosome into fragments whose size depends on 
the radiation dose. These fragments are rescued by fusion with cultured rodent cells, yielding subclones 

1 5 containing different portions of the human genome. This technique is described by Benham et al. 

(Genomics 4:509-517, 1989) and Cox et al, {Science 250:245-250, 1990). The random and independent 
nature of the subclones permits efficient mapping of any human genome marker. Human DNA isolated 
from a panel of 80-100 cell lines provides a mapping reagent for ordering EST-related nucleic acids, 
positional segments of EST-related nucleic acids or fragments of positional segments of EST-related 

20 nucleic acids. In this approach, the frequency of breakage between markers is used to measure distance, 
allowing construction of fine resolution maps as has been done using conventional ESTs (Schuler et ai^ 
Science 274:540-546, 1996). 

RH mapping has been used to generate a high-resolution whole genome radiation hybrid map of 
human chromosome 17q22-q25.3 across the genes for growth hormone (GH) and thymidine kinase (TK) 

25 (Foster et al. Genomics 33:185-192, 1996), the region surrounding the Gorlin syndrome gene 
(Obermayr et al, Eur. J, Hum, Genet, 4:242-245, 1996), 60 loci covering the entire short arm of 
chromosome 12 (Raeymaekers et al. Genomics 29: 170-178, 1995), the region of human chromosome 
22 containing the neurofibromatosis type 2 locus (Frazer et al, Genomics 14:574-584, 1992) and 13 loci 
on the long arm of chromosome 5 (Warrington et al. Genomics 11:701-708, 1991). 

30 

EXAMPLE 44 

Mapping of EST-related nucleic acids, positional segments of 
EST-related nucleic acids or fragments of positional segments of 
EST-related nucleic acids to Human Chromosomes using PGR techniques 
35 EST-related nucleic acids, positional segments of EST-related nucleic acids or fragments of 

positional segments of EST-related nucleic acids may be assigned to human chromosomes using PGR 
based methodologies. In such approaches, oligonucleotide primer pairs are designed from EST-related 
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nucleic acids, positional segments of EST-related nucleic acids or fragments of positional segments of 
EST-related nucleic acids to minimize the chance of amplifying through an intron. Preferably, the 
oligonucleotide primers are 18-23 bp in length and are designed for PCR amplification. The creation of 
PCR primers from known sequences is well known to those with skill in the art. For a review of PCR 
5 technology see Erlich. in PCR Technology; Principles and Applications for DNA Amplification. 1992. 
W.H. Freeman and Co., New York. 

The primers are used in polymerase chain reactions (PCR) to amplify templates from total 
human genomic DNA. PCR conditions are as follows: 60 ng of genomic DNA is used as a template for 
PCR with 80 ng of each oligonucleotide primer, 0.6 unit of Taq polymerase, and 1 ^Cu of a 32P-labeled 

1 0 deoxycytidine triphosphate. The PCR is performed in a microplate thermocycler (Techne) under the 
following conditions: 30 cycles of 94**C, 1.4 min; 55^C, 2 min; and 72°C, 2 min; with a final extension 
at 72°C for 10 min. The amplified products are analyzed on a 6% polyacrylamide sequencing gel and 
visualized by autoradiography. If the length of the resulting PCR product is identical to the distance 
between the ends of the primer sequences in the 5'EST from which the primers are derived, then the 

1 5 PCR reaction is repeated with DNA templates from two panels of human-rodent somatic cell hybrids, 
BIOS PCRable DNA (BIOS Corporation) and NIGMS Human-Rodent Somatic Cell Hybrid Mapping 
Panel Number 1 (NIGMS, Camden, NJ). 

PCR is used to screen a series of somatic cell hybrid cell lines containing defined sets of human 
chromosomes for the presence of a given 5'EST. DNA is isolated from tfie somatic hybrids and used as 

20 starting templates for PCR.reactions using the primer pairs from the EST-related nucleic acids, positional 
segments of EST-related nucleic acids or fragments of positional segments of EST-related nucleic acids. 
Only those somatic cell hybrids with chromosomes containing the human gene corresponding to the 
EST-related nucleic acids, positional segments of EST-related nucleic acids or fragments of positional 
segments of EST-related nucleic acids will yield an amplified fragment. The 5*ESTs are assigned to a 

25 chromosome by analysis of the segregation pattern of PCR products from the somatic hybrid DNA 
templates. The single human chromosome present in all cell hybrids that give rise to an amplified 
fragment is the chromosome containing that EST-related nucleic acids, positional segments of EST- 
related nucleic acids or fragments of positional segments of EST-related nucleic acids. For a review of 
techniques and analysis of results from somatic cell gene mapping experiments. (See Ledbetter et ai, 

30 Genomics 6:475^81 (1990)). 

Alternatively, the EST-related nucleic acids, positional segments of EST-related nucleic acids or 
Segments of positional segments of EST-related nucleic acids may be mapped to individual 
chromosomes using FISH as described in Example 45 below. 



35 



EXAMPLE 45 

Ma pping of EST-related nucleic acids, positional se gments of 
EST-related nucleic acids or fragments of positional se gments of 
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EST-rclated nucleic acids to Chromosomes Using 
Fluorescence In Situ Hybridization 
Fluorescence in situ hybridization allows the EST-related nucleic acids, positional segments of 
EST-related nucleic acids or fragments of positional segments of EST-related nucleic acids to be mapped 
5 to a particular location on a given chromosome. The chromosomes to be used for fluorescence in situ 
hybridization techniques may be obtained from a variety of sources including cell cultures, tissues, or 
whole blood. 

In a preferred embodiment, chromosomal localization of EST-related nucleic acids, positional 
segments of EST-related nucleic acids or fragments of positional segments of EST-related nucleic acids 

10 arc obtained by FISH as described by Cherif e/ al {Proc. NatL Acad. Set USA, 87:6639-6643, 1990). 
Metaphasc chromosomes are prepared from phytohemagglutinin (PHA)-stimulated blood cell donors. 
PHA-stimulated lymphocytes from healthy males are cultured for 72 h in RPMI-1 640 medium. For 
synchronization, methotrexate (10 ^iM) is added for 17 h, followed by addition of 5-bromodeoxyuridine 
(5-BrdU, 0.1 mM) for 6 h. Colcemid (1 Mg/ml) is added for the last 15 min before harvesting the cells. 

15 Cells are collected, washed in RPMI, incubated with a hypotonic solution of KCl (75 mM) at 37°C for 
15 min and fixed in three changes of methanolracetic acid (3:1). The cell suspension is dropped onto a 
glass slide and air dried. The EST-related nucleic acids, positional segments of EST-related nucleic 
acids or fragments of positional segments of EST-related nucleic acids is labeled with biotin-16 dUTP by 
nick translation according to the manufacturer's instructions (Bethesda Research Laboratories, Bethesda, 

20 MD), purified using a Sephadex G-50 column (Pharmacia, Upsala, Sweden) and precipitated. Just prior 
to hybridization, the DNA pellet is dissolved in hybridization buffer (50% formamide, 2 X SSC, 10% 
dextran sulfate, 1 mg/ml sonicated salmon sperm DNA, pH 7) and the probe is denatured at 70''C for 5- 
10 min. 

Slides kept at -20''C are treated for 1 h at 37°C with RNase A (100 ^ig/ml), rinsed three times in 
25 2 X SSC and dehydrated in an ethanol series. Chromosome preparations are denatured in 70% 

formamide, 2 X SSC for 2 min at 70^C, then dehydrated at 4°C, The slides are treated with proteinase K 
(1 0 ^g/100 ml in 20 mM Tris-HCl, 2 mM CaCb) at 37°C for 8 min and dehydrated. The hybridization 
mixture containing the probe is placed on the slide, covered with a coverslip, sealed with rubber cement 
and incubated overnight in a humid chamber at 37^C. After hybridization and post-hybridization 
30 washes, the biotinylated probe is detected by avidin-FITC and amplified with additional layers of 
biotinylated goat anti-avidin and avidin-FITC. For chromosomal localization, fluorescent R-bands are 
obtained as previously described (Cherif et ai, supra.). The slides are observed under a LEICA 
fluorescence microscope (DMRXA). Chromosomes are counterstained with propidium iodide and the 
fluorescent signal of the probe appears as two symmetrical yellow-green spots on both chromatids of the 
35 fluorescent R-band chromosome (red). Thus, a particular EST-related nucleic acids, positional segments 
of EST-related nucleic acids or fragments of positional segments of EST-related nucleic acids may be 
localized to a particular cytogenetic R-band on a given chromosome. 
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Once the EST-related nucleic acids, positional segments of EST-related nucleic acids or 
fragments of positional segments of EST-related nucleic acids have been assigned to particular 
chromosomes using the techniques described in Examples 42-44 above, they may be utilized to construct 
a high resolution map of the chromosomes on which they are located or to identify the chromosomes in a 
5 sample. 



EXAMPLE 46 

Use of EST-related nucleic acids, positional segments of EST-related nucleic acids or fragments of 
positional segments of EST-related nucleic acids to Construct or Expand Chromosome Maps 

10 Chromosome mapping involves assigning a given unique sequence to a particular chromosome 

as described above. Once the unique sequence has been mapped to a given chromosome, it is ordered 
relative to other unique sequences located on the same chromosome. One approach to chromosome 
mapping utilizes a series of yeast artificial chromosomes ( YACs) bearing several thousand long inserts 
derived from the chromosomes of the organism from which the EST-related nucleic acids, positional 

1 5 segments of EST-related nucleic acids or fragments of positional segments of EST-related nucleic acids 
are obtained. This approach is described in Ramaiah Nagaraja et aL, Genome Research 7:210-222, 
March 1997. Briefly, in this approach each chromosome is broken into overlapping pieces which are 
inserted into the YAC vector. The YAC inserts are screened using PGR or other methods to determine 
whether they include the EST-related nucleic acids, positional segments of EST-related nucleic acids or 

20 fragments of positional segments of EST-related nucleic acids whose position is to be determined. Once 
an insert has been found which includes the 5'EST, the insert can be analyzed by PCR or other methods 
to determine whether the insert also contains other sequences known to be on the chromosome or in the 
region from which the EST-related nucleic acids, positional segments of EST-related nucleic acids or 
fragments of positional segments of EST-related nucleic acids was derived. This process can be 

25 repeated for each insert in the YAC library to determine the location of each of the EST-related nucleic 
acids, positional segments of EST-related nucleic acids or fragments of positional segments of EST- 
related nucleic acids relative to one another and to other known chromosomal markers. In this way, a 
high resolution map of the distribution of numerous unique markers along each of the organisms 
chromosomes may be obtained. 

30 As described in Example 47 below EST-related nucleic acids, positional segments of EST- 

related nucleic acids or fragments of positional segments of EST-related nucleic acids may also be used 
to identify genes associated with a particular phenotype, such as hereditary disease or drug response. 



3. Use of EST-related nucleic acids, positional segments of EST-related nucleic acids or fragments of 
35 positional segments of EST-related nucleic acids Gene Identification 

EXAMPLE 47 

Identification of genes associated with hereditary diseases or drug response 
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This example illustrates an approach useiul for the association of EST-related nucleic acids, 
positional segments of EST-related nucleic acids or fragments of positional segments of EST-related 
nucleic acids with particular phenotypic characteristics. In this example, a particular EST-related nucleic 
acids, positional segments of EST-related nucleic acids or fragments of positional segments of EST- 
5 related nucleic acids is used as a test probe to associate that EST-related nucleic acids, positional 

segments of EST-related nucleic acids or fragments of positional segments of EST-related nucleic acids 
with a particular phenotypic characteristic. 

EST-related nucleic acids, positional segments of EST-related nucleic acids or fragments of 
positional segments of EST-related nucleic acids are mapped to a particular location on a human 

1 0 chromosome using techniques such as those described in Examples 4 1 and 42 or other techniques known 
in the art. A search of Mendelian Inheritance in Man (V. McKusick, Mendelian Inheritance in Man 
(available on line through Johns Hopkins University Welch Medical Library) reveals the region of the 
human chromosome which contains the EST-related nucleic acids, positional segments of EST-related 
nucleic acids or fragments of positional segments of EST-related nucleic acids to be a very gene rich 

1 5 region containing several known genes and several diseases or phcnotypes for which genes have not 
been identified The gene corresponding to this EST-related nucleic acids, positional segments of EST- 
related nucleic acids or fragments of positional segments of EST-related nucleic acids thus becomes an 
immediate candidate for each of these genetic diseases. 

Cells from patients with these diseases or phenotypes are isolated and expanded in culture. PGR 

20 primers from the EST-related nucleic acids, positional segments of EST-related nucleic acids or 

fragments of positional segments of EST-related nucleic acids are used to screen genomic DNA, mRNA 
or cDNA obtained from the patients. EST-related nucleic acids, positional segments of EST-related 
nucleic acids or fragments of positional segments of EST-related nucleic acids that are not amplified in 
the patients can be positively associated with a particular disease by ftirther analysis. Alternatively, the 

25 PCR analysis may yield fragments of different lengths when the samples are derived from an individual 
having the phenotype associated v^nth the disease than when the sample is derived from a healthy 
individual, indicating that the gene containing the EST-related nucleic acids, positional segments of 
EST-related nucleic acids or fragments of positional segments of EST-related nucleic acids may be 
responsible for the genetic disease. 

30 

VII. Use of EST-related nucleic acids, positional segments of EST-related nucleic acids or 
fragments of positional segments of EST-related nucleic acids to Construct Vectors 

The present EST-r«lated nucleic acids, positional segments of EST-related nucleic acids or 
fiagments of positional segments of EST-related nucleic acids may also be used to construct secretion 
35 vectors capable of directing the secretion of the proteins encoded by genes therein. Such secretion 

vectors may facilitate the purification or enrichment of the proteins encoded by genes inserted therein by 
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reducing the number of background proteins from which the desired protein must be purified or 
enriched. Exemplary secretion vectors are described in Example 48 below. 
1. Construction of secretion vectors 

EXAMPLE 48 

5 Construction of Secretion Vectors 

The secretion vectors of the present invention include a promoter capable of directing gene 
expression in the host cell, tissue, or organism of interest. Such promoters include the Rous Sarcoma 
Virus promoter, the SV40 promoter, the human cytomegalovirus promoter, and other promoters ^miliar 
to those skilled in the art. 

1 0 A signal sequence from one of the EST-related nucleic acids, positional segments of EST- 

related nucleic acids or fragments of positional segments of £ST>related nucleic acids is operably linked 
to the promoter such that the mRNA transcribed from the promoter v^dll direct the translation of the 
signal peptide. Preferably, the signal sequence is from one of the nucleic acids of SEQ ID NOs.24-8 1 1 . 
The host cell, tissue, or organism may be any cell, tissue, or organism which recognizes the signal 

15 peptide encoded by the signal sequence in the EST-related nucleic acids, positional segments of EST- 
related nucleic acids or fragments of positional segments of EST-related nucleic acids. Suitable hosts 
include mammalian cells, tissues or organisms, avian cells, tissues, or organisms, insect cells, tissues or 
organisms, or yeast 

In addition, the secretion vector contains cloning sites for inserting genes encoding the proteins 
20 which are to be secreted. The cloning sites facilitate the cloning of the insert gene in frame with the 
signal sequence such that a fusion protein in which the signal peptide is fused to the protein encoded by 
the inserted gene is expressed from the mRNA transcribed from the promoter. The signal peptide directs 
the extracellular secretion of the fusion protein. 

The secretion vector may be DNA or RNA and may integrate into the chromosome of the host, 
25 be stably maintained as an extrachromosomal replicon in the host, be an artificial chromosome, or be 
transiently present in the host. Preferably, the secretion vector is maintained in multiple copies in each 
host cell. As used herein, multiple copies means at least 2, 5, 10, 20, 25, 50 or more than 50 copies per 
cell. In some embodiments, the multiple copies are maintained extrachromosomally. In other 
embodiments, the multiple copies result from amplification of a chromosomal sequence. 
30 Many nucleic acid backbones suitable for use as secretion vectors are known to those skilled in 

the art, including retroviral vectors, SV40 vectors. Bovine Papilloma Virus vectors, yeast integrating 
plasmids, yeast episomal plasmids, yeast artificial chromosomes, human artificial chromosomes, P 
element vectors, baculovirus vectors, or bacterial plasmids capable of being transiently introduced into 
the host. 

35 The secretion vector may also contain a polyA signal such that the polyA signal is located 

downstream of the gene inserted into the secretion vector. 
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After the gene encoding the protein for which secretion is desired is inserted into the secretion 
vector, the secretion vector is introduced into the host cell, tissue, or organism using calcium phosphate 
precipitation, DEAE-Dextran, electroporation, liposome-mediated transfection, viral particles or as 
naked DNA. The protein encoded by the inserted gene is then purified or enriched from the supernatant 
5 using conventional techniques such as ammonium sulfate precipitation, immunoprecipitation, 

immunoafTinitychromatography, size exclusion chromatography, ion exchange chromatography, and 
HPLC. Ahematively, the secreted protein may be in a sufficiently enriched or pure state in the 
supernatant or growth media of the host to permit it to be used for its intended purpose without further 
enrichment. 

1 0 The signal sequences may also be inserted into vectors designed for gene therapy. In such 

vectors, the signal sequence is operably linked to a promoter such that mRNA transcribed from the 
promoter encodes the signal peptide. A cloning site is located downstream of the signal sequence such 
that a gene encoding a protein whose secretion is desired may readily be inserted into the vector and 
fused to the signal sequence. The vector is introduced into an appropriate host cell. The protein 

1 5 expressed from the promoter is secreted extracellularly, thereby producing a therapeutic effect. 

EXAMPLE 49 

Fusion Vectors 

The EST-related nucleic acids, positional segments of EST-related nucleic acids or fragments of 
20 positional segments of EST-related nucleic acids may be used to construct fusion vectors for the 
expression of chimeric polypeptides. The chimeric polypeptides comprise a first polypeptide portion 
and a second polypeptide portion. In the fusion vectors of the present invention, nucleic acids encoding 
the first polypeptide portion and the second polypeptide portion are joined in frame with one another so 
as to generate a nucleic acid encoding the chimeric polypeptide. The nucleic acid encoding the chimeric 
25 polypeptide is operably linked to a promoter which directs the expression of an mRNA encoding the 
chimeric polypeptide. The promoter may be in any of the expression vectors described herein including 
those described in Examples 21 and 48. 

Preferably, the fusion vector is maintained in multiple copies in each host cell. In some 
embodiments, the multiple copies are maintained extrachromosomally. In other embodiments, the 
30 multiple copies result from amplification of a chromosomal sequence. 

The first polypeptide portion may comprise any of the polypeptides encoded by the EST-related 
nucleic acids, positional segments of EST-related nucleic acids or fragments of positional segments of 
EST-related nucleic acids. In some embodiments, the first polypeptide portion may be one of the EST- 
related polypeptides, fragments of EST-related polyp)eptides, positional segments of EST-related 
35 polypeptides, or fragments of positional segments of EST-related polypeptides. 

The second pol3T[)eptide portion may comprise any polypeptide of interest. In some 
embodiments, the second polypeptide portion may comprise a polypeptide having a detectable 



wo 99/53051 PCT/IB99/00712 

123 

enzymatic activity such as green fluorescent protein or P galactosidase. Chimeric polypeptides in which 
the second polypeptide portion comprises a detectable polypeptide may be used to determine the 
intracellular localization of the first polypeptide portion. In such procedures, the fusion vector encoding 
the chimeric polypeptide is introduced into a host cell under conditions which facilitate the expression of 
5 the chimeric polypeptide. Where appropriate, the cells are treated with a detection reagent which is 
visible under the microscope following a catalytic reaction with flie detectable polypeptide and the 
cellular location of the detection reagent is determined. For example, if the polypeptide having a 
detectable enzymatic activity is P galactosidase, the cells may be treated with Xgal. Alternatively, where 
the detectable polypeptide is directly detectable without the addition of a detection reagent, the 
1 0 intracellular location of the chimeric polypeptide is determined by performing microscopy under 
conditions in which the dcctablc polypeptide is visible. For example, if the detectable polypeptide is 
green fluorescent protein or a modified version thereof, microscopy is performed by exposing the host 
cells to light having an appropriate wavelength to cause the green fluorescent protein or modified version 
thereof to fluoresce. 

1 5 Alternatively, the second polypeptide portion may comprise a polypeptide whose isolation, 

purification, or enrichment is desired. In such embodiments, the isolation, purification, or enrichment of 
the second polypeptide portion may be achieved by performing the immunoaffmity chromatography 
procedures described below using an immunoaffinity column having an antibody directed against the 
first polypeptide portion coupled thereto. 

20 The proteins encoded by the EST-related nucleic acids, positional segments of EST-related 

nucleic acids or fi-agments of positional segments of EST-related nucleic acids or the EST-related 
polypeptides, fragments of EST-reiated polypeptides, positional segments of EST-related polypeptides, 
or Segments of positional segments of EST-related polypeptides may also be used to generate antibodies 
as explained herein in order to identify the tissue type or cell species from which a sample is derived as 

25 described in Example 50. 

EXAMPLE 50 

Identification of Tissue Types or Cell Species bv Means of 
Labeled Tissue Specific Antibodies 

30 Identification of specific tissues is accomplished by the visualization of tissue specific antigens 

by means of antibody preparations as described herein which are conjugated, directly or indirectly to a 
detectable marker. Selected labeled antibody species bind to their specific antigen binding partner in 
tissue sections, cell suspensions, or in extracts of soluble proteins fi^om a tissue sample to provide a 
pattern for qualitative or semi-qualitative interpretation. 

35 Antisera for these procedures must have a potency exceeding that of the native preparation, and 

for that reason, antibodies are concentrated to a mg/ml level by isolation of the gamma globulin fraction, 
for example, by ion-exchange chromatography or by ammonium sulfate fractionation. Also, to provide 
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the most specific antisera, unwanted antibodies, for example to common proteins, must be removed from 
the gamma globuUn fraction, for example by means of insoluble immunoabsorbents, before the 
antibodies are labeled with the marker. Either monoclonal or heterologous antisera is suitable for either 
procedure. 
5 L Immunohistochemical Techniques 

Purified, high-titer antibodies, prepared as described above, are conjugated to a detectable 
marker, as described, for example, by Fudenberg, H., Chap. 26 in.* Basic 503 Clinical Immunology, 3"* 
Ed. Lange, Los Altos, California (1980) or Rose,, el ai. Chap. 12 in: Methods in Immunodiagnosis, 2d 
Ed. John Wiley and Sons, New York (1980). 

10 A fluorescent marker, either fluorescein or rhodamine, is preferred, but antibodies can also be 

labeled with an enzyme that supports a color producing reaction with a substrate, such as horseradish 
peroxidase. Markers can be added to tissue-bound antibody in a second step, as described below. 
Alternatively, the specific antitissue antibodies can be labeled with ferritin or other electron dense 
particles, and localization of the ferritin coupled antigen-antibody complexes achieved by means of an 

1 5 electron microscope. In yet another approach, the antibodies are radiolabeled, with, for example '^^I, and 
detected by overlaying the antibody treated preparation with photographic emulsion. 

Preparations to carry out the procedures can comprise monoclonal or polyclonal antibodies to a 
single protein or peptide identified as specific to a tissue type, for exanple, brain tissue, or antibody 
preparations to several antigenically distinct tissue specific antigens can be used in panels, independently 

20 or in mixtures, as required. 

Tissue sections and cell suspensions are prepared for immunohistochemical examination 
according to common histological techniques. Multiple cryostat sections (about 4 ^m, unfixed) of the 
unknown tissue and known control, are mounted and each slide covered with different dilutions of the 
antibody preparation. Sections of known and unknown tissues should also be treated with preparations 

25 to provide a positive control, a negative control, for example, pre-inunune sera, and a control for non- 
specific staining, for example, buffer. 

Treated sections are incubated in a humid chamber for 30 min at room temperature, rinsed, then 
washed in buffer for 30-45 min. Excess fluid is blotted away, and the marker developed. 

If the tissue specific antibody was not labeled in the first incubation, it can be labeled at this time 

30 in a second antibody-antibody reaction, for example, by adding fluorescein- or enzyme-conjugated 
antibody against the immunoglobulin class of the antiserum-producing sj)ecies, for example, fluorescein 
labeled antibody to mouse IgG. Such labeled sera are commerdally available. 

The antigen found in the tissues by the above procedure can be quantified by measuring the 
intensity of color or fluorescence on the tissue section, and calibrating that signal using appropriate 

35 standards. 

2. Identification of Tissue Specific Soluble Proteins 
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The visualization of tissue specific proteins and identification of unknown tissues from that 
procedure is carried out using the labeled antibody reagents and detection strategy as described for 
immunohistochemistry; however the sample is prepared according to an electrophoretic technique to 
distribute the proteins extracted fix)m the tissue in an orderly array on the basis of molecular weight for 
5 detection. 

A tissue sample is homogenized using a Virtis apparatus; cell suspensions are dismpted by 
Dounce homogenization or osmotic lysis, using detergents in either case as required to disrupt cell 
membranes, as is the practice in the art. Insoluble cell components such as nuclei, microsomes, and 
membrane fragments are removed by ultracentrifugation, and the soluble protein-containing fraction 

1 0 concentrated if necessary and reserved for analysis. 

A sample of the soluble protein solution is resolved into individual protein species by 
conventional SDS polyacrylamide electrophoresis as described, for example, by DavisJ.. et aL^ Section 
19-2 in: Basic Methods in Molecular Biology (P. Leder, ed), Elsevier, New York (1986), using a range 
of amounts of polyacrylamide in a set of gels to resolve the entire molecular weight range of proteins to 

1 5 be detected in the sample. A size marker is run in parallel for purposes of estimating molecular weights 
of the constituent proteins. Sample size for analysis is a convenient volume of from 5 to 55 ^1, and 
containing from about 1 to 1 00 \ig protein. An aliquot of each of the resolved proteins is transferred by 
blotting to a nitrocellulose filter paper, a process that maintains the pattern of resolution. Multiple copies 
are prepared. The procedure, known as Western Blot Analysis, is well described in Davis, L. et aly 

20 supra Section 1 9-3 . One set of nitrocellulose blots is stained with Coomassie Blue dye to visualize the 
entire set of proteins for comparison with the antibody bound proteins. The remaining nitrocellulose 
filters are then incubated with a solution of one or more specific antisera to tissue specific proteins 
prepared as described in Examples 20 and 33. In this procedure, as in procedure A above, appropriate 
positive and negative sample and reagent controls are run. 

25 In either procedure described above a detectable label can be attached to the primary tissue 

antigen-primary antibody complex according to various strategies and permutations thereof In a 
straightforward approach, the primary specific antibody can be labeled; alternatively, the unlabeled 
complex can be bound by a labeled secondary anti-IgG antibody. In other approaches, either the primary 
or secondary antibody is conjugated to a biotin molecule, which can, in a subsequent step, bind an avidin 

30 conjugated marker. According to yet another strategy, enzyme labeled or radioactive protein A, which 
has the property of binding to any IgG, is bound in a fmal step to either the primary or secondary 
antibody. 

EXAMPLES! 

35 Immunohistochemical Localization of Polypeptides 

The antibodies prepared as described herein above may be utilized to determine the cellular 
location of a polypeptide. The polypeptide may be any of the polypeptides encoded by EST-related 
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nucleic acids, positional segments of EST-related nucleic acids or fragments of positional segments of 
EST-related nucleic acids or the polypeptide may be one of the EST-related polypeptides, fragments of 
EST-related polypeptides, positional segments of EST-related polypeptides, or fragments of positional 
segments of EST-related polypeptides, hi some embodiments, the polypeptide may be a chimeric 
5 polypeptide such as those encoded by the fusion vectors of Example 49. 

Cells expressing the polypeptide to be localized are applied to a microscope slide and fixed 
using any of the procedures typically employed in immunohistochemical locahzation techniques, 
including the methods described in Current Protocols in Molecular Biology, John Wiley and Sons, Inc. 
1997. Following a washing step, the cells are contacted with the antibody. In some embodiments, the 

10 antibody is conjugated to a detectable marker as described above to facilitate detection. Alternatively, in 
some embodiments, after the cells have been contacted with an antibody to the polypeptide to be 
localized, a secondary antibody which has been conjugated to a detectable marker is placed in contact 
with the antibody against the polypeptide to be localized. 

Thereafter, microscopy is performed under conditions suitable for visualizing the cellular 

1 5 location of the polypeptide. 

The visuahzation of tissue specific antigen binding at levels above those seen in control tissues 
to one or more tissue specific antibodies, directed against the polypeptides encoded by EST-related 
nucleic acids, positional segments of EST-related nucleic acids or fragments of positional segments of 
EST-related nucleic acids or antibodies against the EST-related polypeptides, fragments of EST-related 

20 polypeptides, positional segments of EST-related polypeptides, or fragments of positional segments of 
EST-related polypeptides, can identify tissues of unknown origin, for example, forensic samples, or 
differentiated tumor tissue that has metastasized to foreign bodily sites. 

The antibodies described herein may also be used in the immunoaffinity chromatography 
techniques described below to isolate, purify or enrich the polypeptides encoded by the EST-related 

25 nucleic acids, positional segments of EST-related nucleic acids or fragments of positional segments of 
EST-related nucleic acids or to isolate, purify or enrich EST-related polypeptides, fragments of EST- 
related polypeptides, positional segments of EST-related polypeptides, or fragments of positional 
segments of EST-related polypeptides. The immunoaffinity chromatography techniques described 
below may also be used to isolate, purify or enrich polypeptides which have been linked to the 

30 polypeptides encoded by the EST-related nucleic acids, positional segments of EST-related nucleic acids 
or fragments of positional segments of EST-related nucleic acids or to isolate, purify or enrich 
polypeptides which have been linked to EST-related polypeptides, fragments of EST-related 
polypeptides, positional segments of EST-related polypeptides, or fragments of positional segments of 
EST-related polypeptides. 
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EXAMPLE 52 

Immunoaflinity ChroTnatography 
Antibodies prepared as described above are coupled to a support. Preferably, the antibodies are 
monoclonal antibodies, but polyclonal antibodies may also be used. The support may be any of those 
5 typically employed in immunoaffmity chromatography, including Sepharose CL-4B (Pharmacia, 
Piscataway, NJ), Sepharose CL-2B (Pharmacia, Piscataway, NJ), Affi-gel 10 (Biorad, Richmond, CA), 
or glass beads. 

The antibodies may be coupled to the support using any of the coupling reagents typically used 
in immunoaffinity chromatography, including cyanogen bromide. After coupling the antibody to the 

10 support, the support is contacted with a sample which contains a target polypeptide whose isolation, 
purification or enrichment is desired. The target polypeptide may be a polypeptide encoded by the EST- 
related nucleic acids, positional segments of EST-related nucleic acids or fragments of positional 
segments of EST-related nucleic acids or the target polypeptide may be one of the EST-related 
polypeptides, fragments of EST-related polypeptides, positional segments of EST-related polypeptides, 

1 5 or fragments of positional segments of EST-related polypeptides. The target polypeptides may also be 
polypeptides which have been linked to the polypeptides encoded by the EST-related nucleic acids, 
positional segments of EST-related nucleic acids or fragments of positional segments of EST-related 
nucleic acids or the target polypeptides may be polypeptides which have been linked to EST-related 
polypeptides, fragments of EST-related polypeptides, positional segments of EST-related polypeptides, 

20 or fragments of positional segments of EST-related polypeptides using the fusion vectors described 
above. 

Preferably, the sample is placed in contact with the support for a sufficient amount of time and 
under appropriate conditions to allow at least 50% of the target polypeptide to specifically bind to the 
antibody coupled to the support. 

25 Thereafter, the support is washed with an appropriate wash solution to remove polypeptides 

which have non-specifically adhered to the support. The wash solution may be any of those typically 
employed in immunoaffinity chromatography, including PBS, Tris-lithium chloride buffer (O.IM lysine 
base and 0.5M lithium chloride, pH 8.0), Tris-hydrochloride buffer (0.05M Tris-hydrochloride, pH 8.0), 
or Tris/Triton/NaCl buffer (50mM Tris.cl, pH 8.0 or 9.0, 0.1% Triton X-100, and 0.5MNaCl). 

30 After washing, the specifically bound target polypeptide is eluted from the support using the 

high pH or low pH elution solutions typically employed in immunoaffinity chromatography. In 
particular, the elution solutions may contain an eluant such as triethanolamine, diethylamine, calcium 
chloride, sodium thiocyanate, potasssium bromide, acetic acid, or glycine. In some embodiments, the 
elution solution may also contain a detergent such as Triton X-100 or octyl-P-D-glucoside. 

35 The EST-related nucleic acids, positional segments of EST-related nucleic acids or fragments of 

positional segments of EST-related nucleic acids may also be used to clone sequences located upstream 
of the 5*ESTs which are capable of regulating gene expression, including promoter sequences, enhancer 
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sequences, and other upstream sequences which influence transcription or translation levels. Once 
identified and cloned, these upstream regulatory sequences may be used in expression vectors designed 
to direct the expression of an inserted gene in a desired spatial, temporal, developmental, or quantitative 
fashion. Example 51 describes a method for cloning sequences upstream of the EST-related nucleic 
5 acids, positional segments of EST-related nucleic acids or fragments of positional segments of EST- 
related nucleic acids. 

2. Identification of upstream sequences with promoting or regulatory activities 

EXAMPLE 53 

10 Use of EST-related nucleic acids, positional segments of EST-related nucleic ac ids or fragments of 
positional segments of EST-related nucleic acids to Clone Upstream Sequences from Genomic DNA 
Sequences derived from EST-related nucleic acids, positional segments of EST-related nucleic 
acids or fragments of positional segments of EST-related nucleic acids may be used to isolate the 
promoters of the corresponding genes using chromosome walking techniques. In one chromosome 

1 5 walking technique, which utilizes the GenomeWalker^'^ kit available from Clontech, five complete 
genomic DNA samples are each digested with a different restriction enzyme which has a 6 base 
recognition site and leaves a blunt end. Following digestion, oHgonucleotide adapters are hgated to each 
end of the resulting genomic DNA fragments. 

For each of the five genomic DNA libraries, a first PGR reaction is performed according to the 

20 manufacturer's instructions using an outer adapter primer provided in the kit and an outer gene specific 
primer. The gene specific primer should be selected to be specific for 5' EST of interest and should have 
a melting temperature, length, and location in the EST-related nucleic acids, positional segments of EST- 
related nucleic acids or fragments of positional segments of EST-related nucleic acids which is 
consistent with its use in PGR reactions. Each first PGR reaction contains 5ng of genomic DNA, 5 ^il of 

25 1 OX Tth reaction buffer, 02 mM of each dNTP, 0.2 each of outer adapter primer and outer gene 
specific primer, 1.1 mM of Mg(0Ac)2, and 1 \i\ of the Tth polymerase SOX mix in a total volume of 50 
pi. The reaction cycle for the first PGR reaction is as follows: 1 min at 94^C / 2 sec at 94X, 3 min at 
72*'G (7 cycles) / 2 sec at 94°G, 3 min at 67X (32 cycles) / 5 min at 6TC. 

The product of the first PGR reaction is diluted and used as a template for a second PGR 

30 reaction according to the manufacturer's instructions using a pair of nested primers which are located 
internally on the amplicon resuking from the first PGR reaction. For example, 5 \il of the reaction 
product of the first PGR reaction mixture may be diluted 1 80 times. Reactions are made in a 50 \il 
volume having a composition identical to that of the first PGR reaction except the nested primers are 
used. The first nested primer is specific for the adapter, and is provided with the GenomeWalker™ 

35 kit. The second nested primer is specific for the particular EST-related nucleic acids, positional 

segments of EST-related nucleic acids or fragments of positional segments of EST-related nucleic acids 
for which the promoter is to be cloned and should have a melting temperature, length, and location in 



wo 99/53051 PCT/IB99/00712 

129 

the EST-related nucleic acids, positional segments of EST-related nucleic acids or fragments of 
positional segments of EST-related nucleic acids which is consistent with its use in PGR reactions. 
The reaction parameters of the second PGR reaction are as follows: 1 min at 94°G / 2 sec at 94°C, 3 
min at 72°G (6 cycles) / 2 sec at 94**C, 3 min at 6TC (25 cycles) / 5 min at 67**C. The product of the 
5 second PGR reaction is purified, cloned, and sequenced using standard techniques. 

Alternatively, two or more human genomic DNA libraries can be constructed by using two or 
more restriction enzymes. The digested genomic DNA is cloned into vectors which can be converted 
into single stranded, circular, or linear DNA. A biotinyiated oligonucleotide comprising at least 10, 12, 
15, 18, 20, 23, 25, 27, 30, 35, 40, or 50 nucleotides from the EST-related nucleic acids, positional 

10 segments of EST-related nucleic acids or fragments of positional segments of EST-related nucleic acids 
sequence is hybridized to the single stranded DNA. Hybrids between the biotinyiated oligonucleotide 
and the single stranded DNA containing the EST-related nucleic acids, positional segments of EST- 
related nucleic acids or fragments of positional segments of EST-related nucleic acids are isolated as 
described above. Thereafter, the single stranded DNA containing the EST-related nucleic acids, 

1 5 positional segments of EST-related nucleic acids or fragments of positional segments of EST-related 
nucleic acids is released from the beads and converted into double stranded DNA using a primer specific 
for the EST-related nucleic acids, positional segments of EST-related nucleic acids or fragments of 
positional segments of EST-related micieic acids or a primer corresponding to a sequence included in the 
cloning vector. The resulting double stranded DNA is transformed into bacteria. cDNAs containing the 

20 EST-related nucleic acids, positional segments of EST-related nucleic acids or fragments of positional 
segments of EST-related nucleic acids are identified by colony PGR or colony hybridization. 

Once the upstream genomic sequences have been cloned and sequenced as described above, 
prospective promoters and transcription start sites within the upstream sequences may be identified by 
comparing the sequences upstream of the EST-related nucleic acids, positional segments of EST-related 

25 nucleic acids or fragments of positional segments of EST-related nucleic acids with databases containing 
known transcription start sites, transcription factor binding sites, or promoter sequences. 

In addition, promoters in the upstream sequences may be identified using promoter reporter 
vectors as described in Example 54. 

30 EXAMPLE 54 

Identification of Promoters in Gloned Upstream Sequences 
The genomic sequences upstream of the EST-related nucleic acids, positional segments of EST- 
related nucleic acids or fragments of positional segments of EST-related nucleic acids are cloned into a 
suitable promoter reporter vector, such as the pSEAP-Basic, pSEAP-Enhancer, pp-gal-Basic, pP-gal- 
35 Enhancer, or pEGFP-1 Promoter Reporter vectors available from Clontech. Briefly, each of these 

promoter reporter vectors include multiple cloning sites positioned upstream of a reporter gene encoding 
a readily assayable protein such as secreted alkaline phosphatase, (J-galactosidase, or green fluorescent 
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protein. The sequences upstream of the EST-related nucleic acids, positional segments of EST-related 
nucleic acids or fragments of positional segments of EST-related nucleic acids are inserted into the 
cloning sites upstream of the reporter gene in both orientations and introduced into an appropriate host 
cell. The level of reporter protein is assayed and compared to the level obtained from a vector which 
5 lacks an insert in the cloning site. The presence of an elevated expression level in the vector containing 
the insert with respect to the control vector indicates the presence of a promoter in the insert. If 
necessary, the upstream sequences can be cloned into vectors which contain an enhancer for augmenting 
transcription levels from weak promoter sequences. A significant level of expression above that 
observed with the vector lacking an insert indicates that a promoter sequence is present in the inserted 

10 upstream sequence. 

Appropriate host cells for the promoter reporter vectors may be chosen based on the results of 
the above described determination of expression patterns of the EST-related nucleic acids, positional 
segments of EST-related nucleic acids or fragments of positional segments of EST-related nucleic acids. 
For example, if the expression pattern analysis indicates that the mRNA corresponding to a particular 

1 5 EST-related nucleic acids, positional segments of EST-related nucleic acids or fragments of positional 
segments of EST-related nucleic acids is expressed in fibroblasts, the promoter reporter vector may be 
introduced into a human fibroblast cell line. 

Promoter sequences within the upstream genomic DNA may be fiirther defined by constructing 
nested deletions in the upstream DNA using conventional techniques such as Exonuclease III digestion. 

20 The resulting deletion fragments can be inserted into the promoter reporter vector to determine whether 
the deletion has reduced or obliterated promoter activity. In this way, the boundaries of the promoters 
may be defmed. If desired, potential individual regulatory sites within the promoter may be identified 
using site directed mutagenesis or linker scanning to obliterate potential transcription factor binding sites 
within the promoter individually or in combination. The effects of these mutations on transcription 

25 levels may be determined by inserting the mutations into the cloning sites in the promoter reporter 
vectors. 

EXAMPLE 55 
Cloning and Identification of Promoters 
30 Using the method described in Example 54 above with 5* ESTs, sequences upstream of several 

genes were obtained. Using the primer pairs GGG AAG ATG GAG ATA GTA TTG CCT G (SEQ ED 
NO: 15) and CTG CCA TGT ACA TGA TAG AGA GAT TC (SEQ ID NO: 16), the promoter having 
the internal designation P13H2 (SEQ ID NO: 17) was obtained. 

Using the primer pairs GTA CCA GGGG ACT GTG ACC ATT GC (SEQ ID NO: 1 8) and CTG 
35 TGA CCA TTG CTC CCA AGA GAG (SEQ ID NO: 1 9), the promoter having the internal designation 
P15B4 (SEQ ID NO:20) was obtained. 
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Using the primer pairs CTG GGA TOG AAG GCA CGG TA (SEQ ID N0:2 1) and GAG ACC 
ACA CAG CTA GAG AA (SEQ ID NO:22), the promoter having the internal designation P29B6 (SEQ 
ID NO:23) was obtained. 

Figure 4 provides a schematic description of the promoters isolated and the way they are 

5 assembled with the corresponding 5' tags. The upstream sequences were screened for the presence of 
motifs resembling transcription factor binding sites or known transcription start sites using the computer 
program Matlnspector release 2.0, August 1996. 

Figure 5 describes the transcription factor binding sites present in each of these promoters. The 
columns labeled matrice provides the name of the Matlnspector matrix used. The column labeled 

10 position provides the 5' position of the promoter site. Numeration of the sequence starts from the 
transcription site as determined by matching the genomic sequence with the 5' EST sequence. The 
column labeled "<Mientation" indicates the DNA strand on which the site is found, with the + strand 
being the coding strand as determined by matching the genomic sequence with the sequence of the 5' 
EST. The column labeled "score" provides the Matlnspector score found for this site. The column 

1 5 labeled "length" provides the length of the site in nucleotides. The column labeled "sequence" provides 
the sequence of the site found. 

Bacterial clones containing plasmids containing the promoter sequences described above 
described above are presently stored in the inventor's laboratories under the internal identification 
numbers provided above. The inserts may be recovered from the deposited materials by growing an 

20 aliquot of the appropriate bacterial clone in the appropriate medium. The plasmid DNA can then be 
isolated using plasmid isolation procedures familiar to those skilled in the art such as alkaline lysis 
minipreps or large scale alkaline lysis plasmid isolation procedures. If desired the plasmid DNA may be 
finther enriched by centrifugation on a cesium chloride gradient, size exclusion chromatography, or 
anion exchange chromatography. The plasmid DNA obtained using these procedures may then be 

25 manipulated using standard cloning techniques familiar to those skilled in the art. Alternatively, a PGR 
can be done with primers designed at both ends of the inserted EST-related nucleic acids, positional 
segments of EST-related nucleic acids or fragments of positional segments of EST-related nucleic acids. 
The PGR product which conresponds to the EST-related nucleic acids, positional segments of EST- 
related nucleic acids or fragments of positional segments of EST-related nucleic acids can then be 

30 manipulated using standard cloning techniques familiar to those skilled in the art. 

The promoters and other regulatory sequences located upstream of the EST-related nucleic 
acids, positional segments of EST-related nucleic acids or fragments of positional segments of EST- 
related nucleic acids may be used to design expression vectors capable of directing the expression of an 
inserted gene in a desired spatial, temporal, developmental, or quantitative manner. A promoter capable 

35 of directing the desired spatial, temporal, developmental, and quantitative patterns may be selected using 
the results of the expression analysis described above. For example, if a promoter which confers a high 
level of expression in muscle is desired, the promoter sequence upstream of EST-related nucleic acids. 
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positional segments of EST-related nucleic acids or fragments of positional segments of EST-related 
nucleic acids derived from an mRNA which are expressed at a high level in muscle, as determined by 
the methods above, may be used in the expression vector. 

Preferably, the desired promoter is placed near multiple restriction sites to facilitate the cloning 
5 of the desired insert downstream of the promoter, such that the promoter is able to drive expression of 
the inserted gene. The promoter may be inserted in conventional nucleic acid backbones designed for 
extrachromosomal replication, integration into the host chromosomes or transient expression. Suitable 
backbones for the present expression vectors include retroviral backbones, backbones from eukaryotic 
episomes such as SV40 or Bovine Papilloma Vims, backbones from bacterial episomes, or artificial 
10 chromosomes. 

Preferably, the expression vectors also include a polyA signal downstream of the multiple 
restriction sites for directing the polyadenylation of mRNA transcribed from the gene inserted into the 
expression vector. 

Following the identification of promoter sequences, proteins which interact with the promoter 
1 5 may be identified as described in Example 56 below. 

EXAMPLE 56 
Identification of Proteins Which Interact with 
Promoter Sequences, Upstream Reeulatorv Sequences, or mRNA 
20 Sequences within the promoter region which are likely to bind transcription factors may be 

identified by homology to known transcription factor binding sites or through conventional mutagenesis 
or deletion analyses of reporter plasmids containing the promoter sequence. For example, deletions may 
be made in a reporter plasmid containing the promoter sequence of interest operably linked to an 
assayable reporter gene. The reporter plasmids carrying various deletions within the promoter region are 
25 transfected into an appropriate host cell and the effects of the deletions on expression levels is assessed. 
Transcription factor binding sites within the regions in which deletions reduce expression levels may be 
fijTther localized using site directed mutagenesis, linker scanning analysis, or other techniques familiar to 
those skilled in the art. 

Nucleic acids encoding proteins which interact with sequences in the promoter may be identified 
30 using one-hybrid systems such as those described in the manual accompanying the Matchmaker One- 
Hybrid System kit available from Clontech (Catalog No. K1603-1). Briefly, the Matchmaker One- 
hybrid ^stem is used as follows. The target sequence for which it is desired to identify binding proteins 
is cloned upstream of a selectable reporter gene and integrated into the yeast genome. Preferably, 
multiple copies of the target sequences are inserted into the reporter plasmid in tandem. A library 
35 comprised of fusions between cDNAs to be evaluated for the ability to bind to the promoter and the 
activation domain of a yeast transcription factor, such as GAL4, is transformed into the yeast stram 
containing the integrated reporter sequence. The yeast are plated on selective media to select cells 
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expressing the selectable marker linked to the promoter sequence. The colonies which grow on the 
selective niedia contain genes encoding proteins which bind the target sequence. The inserts in the genes 
encoding the fusion proteins are further characterized by sequencing. In addition, the inserts may be 
inserted into expression vectors or in vitro transcription vectors. Binding of the polypeptides encoded by 
5 the inserts to the promoter DNA may be confirmed by techniques familiar to those skilled in the art, such 
as gel shift analysis or DNAse protection analysis. 

Vm. Use of EST-related nucleic acids, positional segments of EST-related nucleic acids or 
fragments of positional segments of EST-related nucleic acids in Gene Therapy 

1 0 The present invention also comprises the use of EST-related nucleic acids, positional segments 

of EST^elated nucleic acids or fragments of positional segments of EST-related nucleic acids in gene 
therapy strategies, including antisense and triple helix strategies as described in Examples 57 and 58 
below. In antisense approaches, nucleic acid sequences complementary to an mRNA are hybridized to 
the mRNA intracellularly, thereby blocking the expression of the protein encoded by the mRNA. The 

1 5 antisense sequences may prevent gene expression through a variety of mechanisms. For example, the 
antisense sequences may inhibit the ability of ribosomes to translate the mRNA. Alternatively, the 
antisense sequences may block transport of the mRNA from the nucleus to the cytoplasm, thereby 
limiting the amount of mRNA available for translation. Another mechanism through which antisense 
sequences may inhibit gene expression is by interfering with mRNA splicing. In yet another strategy, 

20 the antisense nucleic acid may be incorporated in a ribozyme capable of specifically cleaving the target 
mRNA. 

EXAMPLE 57 
Preparation and Use of Antisense Oligonucleotides 
25 The antisense nucleic acid molecules to be used in gene therapy may be either DNA or RNA 

sequences. TTiey may comprise a sequence complementary to the sequence of the EST-related nucleic 
acids, positional segments of EST-related nucleic acids or fragments of positional segments of EST- 
related nucleic acids. The antisense nucleic acids should have a length and melting temperature 
sufficient to permit formation of an intracellular duplex with sufficient stability to inhibit the expression 
30 of die mRNA in the duplex. Strategies for designing antisense nucleic acids suitable for use in gene 
therapy arc disclosed in Green et al, Ann, Rev. Biochem. 55:569-597 (1986) and Izant and Weintraub, 
Ce// 36:1007-1015 (1984). 

In some strategies, antisense molecules are obtained from a nucleotide sequence encoding a 
protein by reversing the orientation of the coding region with respect to a promoter so as to transcribe the 
35 opposite strand fi-om that which is normally transcribed in the cell. The antisense molecules may be 
transcribed using in vitro transcription systems such as those which employ T7 or SP6 polymerase to 



wo 99/53051 PCT/IB99/00712 

134 

generate the transcript. Another approach involves transcription of the antisense nucleic acids in vivo by 
operably linking DNA containing the antisense sequence to a promoter in an expression vector. 

Alternatively, oligonucleotides which are complementary to the strand normally transcribed in 
the cell may be synthesized in vitro. Thus, the antisense nucleic acids are complementary to the 
5 corresponding mRNA and are capable of hybridizing to the mRNA to create a duplex. In some 
embodiments, the antisense sequences may contain modified sugar phosphate backbones to increase 
stability and make them less sensitive to RNase activity. Examples of modifications suitable for use in 
antisense strategies are described by Rossi e( al, Pharmacol. Then 50(2):245-254, (1991). 

Various types of antisense oligonucleotides complementary to the sequence of the EST-related 

1 0 nucleic acids, positional segments of EST-related nucleic acids or fragments of positional segments of 
EST-related nucleic acids may be used. In one preferred embodiment, stable and semi-stable antisense 
oligonucleotides described in International Application No. PCT WO94/23026 are used. In these 
molecules, the 3' end or both the 3* and 5' ends are engaged in intramolecular hydrogen bonding 
between complementary base pairs. These molecules are better able to withstand exonuclease attacks 

1 5 and exhibit increased stability compared to conventional antisense oligonucleotides. 

In another preferred embodiment, the antisense oligodeoxynucleotides against herpes simplex 
virus types 1 and 2 described in International Application No, WO 95/04141 are used. 

In yet another preferred embodiment, the covalently cross-linked antisense oligonucleotides 
described in International Application No. WO 96/31523 are used. These double- or single-stranded 

20 oligonucleotides comprise one or more, respectively, inter- or intra-oligonucleotide covalent cross- 
linkages, wherein the linkage consists of an amide bond between a primary amine group of one strand 
and a caiboxyl group of the other strand or of the same strand, respectively, the primary amine group 
being directly substituted in the 2' position of the strand nucleotide monosaccharide ring, and the 
carboxyl group being carried by an aliphatic spacer group substituted on a nucleotide or nucleotide 

25 analog of the other strand or the same strand, respectively. 

The antisense oligodeoxynucleotides and oligonucleotides disclosed in International Application 
No. WO 92/18522 may also be used. These molecules are stable to degradation and contain at least one 
transcription control recognition sequence which binds to control proteins and are effective as decoys 
therefor. These molecules may contain "hairpin" structures, "dumbbell" structures, "modified 

30 dumbbell" structures, "cross-linked" decoy stmctures and "loop" structures. 

In another preferred embodiment, the cyclic double-stranded oligonucleotides described in 
European Patent Application No. 0 572 287 A2. These ligated oligonucleotide "dumbbells" contain the 
binding site for a transcription factor and inhibit expression of the gene under control of the transcription 
factor by sequestering the factor. 

35 Use of the closed antisense oligonucleotides disclosed in International Application No. WO 

92/19732 is also contemplated. Because these molecules have no free ends, they are more resistant to 
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degradation by exonucleases than are conventional oligonucleotides. These oligonucleotides may be 
multifunctional, interacting with several regions which are not adjacent to the target mRNA. 

The appropriate level of antisense nucleic acids required to inhibit gene expression may be 
determined using in vitro expression analysis. The antisense molecule may be introduced into the cells 
5 by diffusion, injection, infection or transfection using procedures known in the art. For example, the 
antisense nucleic acids can be introduced into the body as a bare or naked oligonucleotide, 
oligonucleotide encapsulated in lipid, oligonucleotide sequence encapsidated by viral protein, or as an 
oligonucleotide operably linked to a promoter contained in an expression vector. The expression vector 
may be any of a variety of expression vectors known in the art, including retroviral or viral vectors, 
10 vectors capable of extrachromosomal replication, or integrating vectors. The vectors may be DNA or 
RNA. 

The antisense molecules are introduced onto cell samples at a number of different 
concentrations preferably between lxlO''**Mto IxlO'^M. Once the minimum concentration that can 
adequately control gene expression is identified, the optimized dose is translated into a dosage suitable 

15 for use in vivo. For example, an inhibiting concentration in culture of 1x10'^ translates into a dose of 
approximately 0,6 mg/kg bodyweight. Levels of oligonucleotide approaching 100 mg/kg bodyweight or 
hi^er may be possible after testing the toxicity of the oligonucleotide in laboratory animals. It is 
additionally contemplated that cells from the vertebrate are removed, treated with the antisense 
oligonucleotide, and reintroduced into the vertebrate. 

20 It is further contemplated that the antisense oligonucleotide sequence is incorporated into a 

ribozyme sequence to enable the antisense to specifically bind and cleave its target mRNA. For 
technical applications of ribozyme and antisense oligonucleotides see Rossi et ai, supra. 

In a preferred application of this invention, the polypeptide encoded by the gene is first 
identified, so that the effectiveness of antisense inhibition on translation can be monitored using 

25 techniques that include but are not limited to antibody-mediated tests such as RIAs and ELISA, 
functional assays, or radiolabeling. 

The EST-related nucleic acids, positional segments of EST-related nucleic acids or fragments of 
positional segments of EST-related nucleic acids may also be used in gene therapy approaches based on 
intracellular triple helix formation. Triple helix oligonucleotides are used to inhibit transcription from a 

30 genome. They are particularly useful for studying alterations in cell activity as it is associated with a 
particular gene. The EST-related nucleic acids, positional segments of EST-related nucleic acids or 
fiagments of positional segments of EST-related nucleic acids of the present invention or, more 
preferably, a portion of those sequences, can be used to inhibit gene expression in individuals having 
diseases associated with expression of a particular gene. Similarly, the EST-related nucleic acids, 

35 positional segments of EST-related nucleic acids or fragments of positional segments of EST-related 
nucleic acids can be used to study the efTect of inhibiting transcription of a particular gene within a cell. 
Traditionally, homopurine sequences were considered the most useful for triple helix strategies. 
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However, homopyrimidine sequences can also inhibit gene expression. Such homopyrimidine 
oligonucleotides bind to the major groove at homopurineihomopyrimidine sequences. Thus, both types 
of sequences from the EST-related nucleic acids, positional segments of EST-related nucleic acids or 
fragments of positional segments of EST-related nucleic acids are contemplated within the scope of this 
5 invention. 

EXAMPLE 58 

Preparation and use of Triple Helix Probes 
The sequences of the EST-related nucleic acids, positional segments of EST-related nucleic 

1 0 acids or fragments of positional segments of EST-related nucleic acids are scanned to identify 1 0-mer to 
20-mer homopyrimidine or homopurine stretches which could be used in triple-helix based strategies for 
inhibiting gene expression. Following identification of candidate homopyrimidine or homopurine 
stretches, their efficiency in inhibiting gene expression is assessed by introducing varying amoimts of 
oligonucleotides containing the candidate sequences into tissue culture cells which normally express the 

15 target gene. The oligonucleotides may be prepared on an oligonucleotide synthesizer or they may be 
purchased commercially from a company specializing in custom oligonucleotide synthesis, such as 
GENSET, Paris, France. 

The oligonucleotides may be introduced into the cells using a variety of methods known to those 
skilled in the art, including but not limited to calcium phosphate precipitation, DEAE-Dextran, 

20 electroporation, liposome-mediated transfection or native uptake. 

Treated cells are monitored for altered cell function or reduced gene expression using techniques 
such as Northern blotting, RNase protection assays, or PCR based strategies to monitor the transcription 
levels of the target gene in cells which have been treated with the oligonucleotide. The cell functions to 
be monitored are predicted based upon the homologies of the target genes corresponding to the EST- 

2S related nucleic acids, positional segments of EST-related nucleic acids or fragments of positional 
segments of EST-related nucleic acids from which the oligonucleotide were derived with known gene 
sequences that have been associated with a particular function. The cell functions can also be predicted 
based on the presence of abnormal physiologies within cells derived from individuals with a particular 
inherited disease, particularly when the EST-related nucleic acids, positional segments of EST-related 

30 nucleic acids or fragments of positional segments of EST-related nucleic acids are associated with the 
disease using techniques described hmin. 

The oligonucleotides which are effective in inhibiting gene expression in tissue culture cells 
may then be introduced in vivo using the techniques described above and in Example 56 at a dosage 
calculated based on the in vitro results, as described in Example 57. 

35 In some embodiments, the natural (beta) anomers of the oligonucleotide units can be replaced 

with alpha anomers to render the oligonucleotide more resistant to nucleases. Further, an intercalating 
agent such as ethidium bromide, or the like, can be attached to the 3' end of the alpha oligonucleotide to 
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stabilize the triple helix. For information on the generation of oligonucleotides suitable for triple helix 
formation see Griffin et al {Science 245:967-971 (1989)). 



EXAMPLE 59 

5 Use of EST-related nucleic acids, positional segments of 

EST-related nucleic acids or fragments of positional segments of 
EST-related nucleic acids to express an Encoded Protein in a Host Organism 
The EST-related nucleic acids, positional segments of EST-related nucleic acids or fragments of 
positional segments of EST-related nucleic acids may also be used to express an encoded protein or 
1 0 polypeptide in a host organism to produce a beneficial effect. In addition, nucleic acids encoding the 
EST-related polypeptides, positional segments of EST-related polypeptides or fragnnents of positional 
segments of EST-related polypeptides may be used to express the encoded protein or polypeptide in a 
host organism to produce a beneficial effect. 

In such procedures, the encoded protein or polypeptide may be transiently expressed in the host 
1 5 organism or stably expressed in the host organism. The encoded protein or polypeptide may have any of 
the activities described above. The encoded protein or polypeptide may be a protein or polypeptide 
which the host organism lacks or, alternatively, the encoded protein may augment the existing levels of 
the protein in the host organism. 

In some embodiments in which the protein or polypeptide is secreted, nucleic acids encoding the 
20 full length protein {i.e. the signal peptide and the mature protein), or nucleic acids encoding only the 
mature protein (i.e. the protein generated when the signal peptide is cleaved off) is introduced into the 
host organism. 

The nucleic acids encoding the proteins or polypeptides may be introduced into the host 
organism using a variety of techniques known to those of skill in the art. For example, the extended 

25 cDNA may be injected into the host organism as naked DNA such that the encoded protein is expressed 
in the host organism, thereby producing a beneficial effect. 

Alternatively, the nucleic acids encoding the protein or polypeptide may be cloned into an 
expression vector downstream of a promoter which is active in the host organism. The expression vector 
may be any of the expression vectors designed for use in gene therapy, including viral or retroviral 

30 vectors. The expression vector may be directly introduced into the host organism such that the encoded 
protein is expressed in the host organism to produce a beneficial effect. In another approach, the 
expression vector may be introduced into cells in vitro. Cells containing the expression vector are 
thereafter selected and introduced into the host organism, where they express the encoded protein or 
polypeptide to produce a beneficial effect. 

35 



EXAMPLE 60 

Use of Signal Peptides To Import Proteins Into Cells 
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The short core hydrophobic region (h) of signal peptides encoded by the sequences of SEQ ID 
NOs. 24-728 and 766-792 may also be used as a carrier to import a peptide or a protein of interest, so- 
called cargo, into tissue culture cells (Lin et al, J. Biol Chem., 270: 14225-14258 (1995); Du et ai, J. 
Peptide Res.. 51: 235-243 (1998); Rojas et al. Nature Biotech., 16: 370-375 (1998)). 

5 When cell permeable peptides of limited size (approximately up to 25 amino acids) are to be 

translocated across cell membrane, chemical synthesis may be used in order to add the h region to either 
the C-terminus or the N-terminus to the cargo peptide of interest. Alternatively, when longer peptides or 
proteins are to be imported into cells, nucleic acids can be genetically engineered, using techniques 
familiar to those skilled in the art, in order to link the extended cDNA sequence encoding the h region to 

10 the 5' or the 3 ' end of a DNA sequence coding for a cargo polypeptide. Such genetically engineered 
nucleic acids are then translated either in vitro or in vivo after transfection into appropriate cells, using 
conventional techniques to produce the resulting cell permeable polypeptide. Suitable hosts cells are 
then simply incubated with the cell permeable polypeptide which is then translocated across the 
membrane. 

1 5 This method may be applied to study diverse intracellular functions and cellular processes. For 

instance, it has been used to probe functionally relevant domains of intracellular proteins and to examine 
protein-protein interactions involved in signal transduction pathways (Lin et ai, supra; Lin et al, J. Biol 
Chem., 271: 5305-5308 (1996); Rojas et al, J, Biol Chem., 271 : 27456-27461 (1996); Liu etal, Proc. 
Natl Acad ScL USA, 93: 11819-11824 (1996); Rojas etal, Biock Biophys. Res. Commun.. 234: 675- 

20 680(1997)). 

Such techniques may be used in cellular therapy to import proteins producing therapeutic 
effects. For instance, cells isolated from a patient may be treated with imported therapeutic proteins and 
then re-introduced into the host organism. 

Alternatively, the h region of signal peptides of the present invention could be used in 
25 combination with a nuclear localization signal to deliver nucleic acids into cell nucleus. Such 

oligonucleotides may be antisense oligonucleotides or oligonucleotides designed to form triple helixes, 
as describedabove, in order to inhibit processing and maturation of a target cellular RNA. 

EXAMPLE 61 

30 Computer Embodiments 

As used herein the term "nucleic acid codes of SEQ ID NOs. 24-8 1 1 and 1 600-1 622" 
encompasses the nucleotide sequences of SEQ ID NOs. 24-81 1 and 1600-1622, firagments of SEQ ID 
NOs. 24-81 1 and 1600-1622, nucleotide sequences homologous to SEQ ID NOs. 24-81 1 and 1600- 
1622 or homologous to fragmentsof SEQ IDNOs. 24-811 and 1600-1622, and sequences 

35 complementary to all of the preceding sequences. The fragments include portions of SEQ ID NOs. 24- 
81 1 and 1600-1622 comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 
consecutive nucleotides of SEQ ID NOs. 24-81 1 and 1600-1622. Preferably, the fragments are novel 
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fragments. Preferably the fragments include polynucleotides described in Table D, polynucleotides 
described in Table III, polynucleotides described in Table IV or portions thereof comprising at least 10, 
15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 consecutive nucleotides of the 
polynucleotides described in Tables II, III, or IV. Homologous sequences and fragments of SEQ ID 
5 NOs. 24-81 1 and 1600-1622 refer to a sequence having at least 99%, 98%, 97%, 96%, 95%, 90%, 85%, 
80%, or 75% homology to these sequences. Homology may be determined using any of the computer 
programs and parameters described in Example 1 8, including BLAST2N with the default parameters or 
with any modified parameters. Homologous sequences also include RNA sequences in which uridines 
replace the thymines in the nucleic acid codes of SEQ ID NOs. 24-8 1 1 and 1 600- 1 622. The 

1 0 homologous sequences may be obtained using any of the procedures described herein or may result from 
the correction of a sequencing error as described above. Preferably the homologous sequences and 
fragments of SEQ ID NOs. 24-8 1 1 and 1 600- 1 622 include polynucleotides described in Table 11. 
polynucleotides described in Table HI, polynucleotides described in Table IV or portions thereof 
comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 consecutive 

1 5 nucleotides of the polynucleotides described in Tables II, HI, or IV. It will be appreciated that the 
nucleic acid codes of SEQ ID NOs. 24-81 1 and 1600-1622 can be represented in the traditional single 
character format (See the inside back cover of Styer, Lubert. Biochemistry, 3"* edition. W. H Freeman & 
Co., New York.) or in any other format which records the identity of the nucleotides in a sequence. 

As used herein the term "polypeptide codes of SEQ ID NOS. 812-1599" encompasses the 

20 polypeptide sequence of SEQ ID NOs. 812-1599 which are encoded by the 5' EST s of SEQ ID NOs. 
24-81 1 and 1600-1622, polypeptide sequences homologous to the polypeptides of SEQ ID NOS. 812- 
1 599, or fragments of any of the preceding sequences. Homologous polypeptide sequences refer to a 
polypeptide sequence having at least 99%, 98%, 97%, 96%, 95%, 90%, 85%, 80%, 75% homology to 
one of the polypeptide sequences of SEQ ID NOS. 812-1599. Homology may be determined using any 

25 of the computer programs and parameters described herein, including FASTA with the default 

parameters or with any modified parameters. The homologous sequences may be obtained using any of 
the procedures described herein or may result from the correction of a sequencing error as described 
above. The polypeptide fragments comprise at least 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 
consecutive amino acids of the polypeptides of SEQ ID NOS. 812-1599. Preferably, the fragments are 

30 novel fragments. Preferably, the fragments include polypeptides encoded by the polynucleotides 

described in Table H, orportions thereof comprising at least 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 
150 consecutive amino acids of the polypeptides encoded by the polynucleotides described in Table II. 
It will be appreciated that the polypeptide codes of the SEQ ED NOS. 8 1 2- 1 599 can be represented in 
the traditional single character format or three letter format (See the inside back cover of Starrier, Lubert. 

35 Biochemistry, 3"* edition. W, H Freeman & Co., New York.) or in any other format which relates the 
identity of the polypeptides in a sequence. 
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It will be appreciated by those skilled in the art that the nucleic acid codes of SEQ ID NOs. 24- 
81 1 and 1600-1622 and polypeptide codes of SEQ ID NOS. 812-1599 can be stored, recorded, and 
manipulated on any medium which can be read and accessed by a computer. As used herein, the words 
"recorded" and "stored" refer to a process for storing information on a computer medium. A skilled 
5 artisan can readily adopt any of the presently known methods for recording information on a computer 
readable medium to generate manufactures comprising one or more of the nucleic acid codes of SEQ ID 
NOs. 24-81 1 and 1600-1622, one or more of the polypeptide codes of SEQ ID NOS. 812-1599. 
Another aspect of the present invention is a computer readable medium having recorded thereon at least 
2, 5, 10, 15, 20, 25, 30, or 50 nucleic acid codes of SEQ ID NOs. 24-81 1 and 1600-1622. Another 

1 0 aspect of the present invention is a computer readable medium having recorded thereon at least 2, 5, 1 0, 
15. 20, 25, 30, or 50 polypeptide codes of SEQ ID NOS. 812-1599. 

Computer readable media include magnetically readable media, optically readable media, 
electronically readable media and magnetic/optical media. For example, the computer readable media 
may be a hard disk, a floppy disk, a magnetic tape, CD-ROM, Digital Versatile Disk (DVD), Random 

1 5 Access Memory (RAM), or Read Only Memory (ROM) as well as other types of other media known to 
those skilled in the art. 

Embodiments of the present invention include systems, particularly computer systems which 
store and manipulate the sequence information described herein. One example of a computer system 
1 00 is illustrated in block diagram form in Figure 6. As used herein, "a computer system" refers to the 

20 hardware components, software components, and data storage components used to analyze the 

nucleotide sequences of the nucleic acid codes of SEQ ID NOs. 24-81 1 and 1600-1 622, or the amino 
acid sequences of the polypeptide codes of SEQ ID NOS. 812-1599. hi one embodiment, the computer 
system 100 is a Sun Enterprise 1000 server (Sun Microsystems, Palo Alto, CA). The computer system 
100 preferably includes a processor for processing, accessing and manipulating the sequence data. The 

25 processor 105 can be any well-known type of central processing unit, such as the Pentium III from Intel 
Corporation, or similar processor from Sun, Motorola, Compaq or International Business Machines. 

Preferably, the computer system 100 is a general purpose system that comprises the processor 
105 and one or more internal data storage components 1 10 for storing data, and one or more data 
retrieving devices for retrieving the data stored on the data storage components. A skilled artisan can 

30 readily appreciate that any one of the currently available computer systems are suitable. 

In one particular embodiment, the computer system 100 includes a processor 105 connected to a 
bus which is connected to a main memory 1 15 (preferably implemented as RAM) and one or more 
internal data storage devices 1 10, such as a hard drive and/or other computer readable media having data 
recorded thereon. In some embodiments, the computer system 100 further includes one or more data 

35 retrieving device 1 1 8 for reading the data stored on the internal data storage devices 1 10. 

The data retrieving device 1 18 may represent, for example, a floppy disk drive, a compact disk 
drive, a magnetic tape drive, etc. In some embodiments, the internal data storage device 1 10 is a ^ 
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removable computer readable medium such as a floppy disk, a compact disk, a magnetic tape, etc. 
containing control logic and/or data recorded thereon. The computer system 100 may advantageously 
include or be programmed by appropriate software for reading the control logic and/or the data from the 
data storage component once inserted in the data retrieving device. 
5 The computer system 100 includes a display 120 which is used to display output to a computer 

user. It should also be noted that the computer system 100 can be linked to other computer systems 
125a-c in a network or wide area network to provide centralized access to the computer system 100. 

Software for accessing and processing the nucleotide sequences of the nucleic acid codes of 
SEQ ID NOs. 24-81 1 and 1600-1622, or the amino acid sequences of the polypeptide codes of SEQ ID 

10 NOS. 812-1599 (such as search tools, compare tools, and modeling tools etc.) may reside in main 
memory 1 15 during execution. 

In some embodiments, the computer system 100 may further comprise a sequence comparer for 
comparing the above-described nucleic acid codes of SEQ ID NOs. 24-81 1 and 1600-1622 or 
polypeptide codes of SEQ ID NOS. 812-1599 stored on a computer readable medium to reference 

1 5 nucleotide or polypeptide sequences stored on a computer readable medium. A "sequence comparer" 
refers to one or more programs which are implemented on the computer system 100 to compare a 
nucleotide or polypeptide sequence with other nucleotide or polypeptide sequences and/or compounds 
including but not limited to peptides, peptidomimetics, and chemicals stored within the data storage 
means. For example, the sequence comparer may compare the nucleotide sequences of the nucleic acid 

20 codes of SEQ ID NOs. 24-8 1 1 and 1 600-1 622, or the amino acid sequences of the polypeptide codes of 
SEQ ID NOS. 8 1 2- 1 599 stored on a computer readable medium to reference sequences stored on a 
computer readable medium to identify homologies, motifs implicated in biological function, or structural 
motifs. The various sequence comparer programs identified elsewhere in this patent specification are 
particularly contemplated for use in this aspect of the invention. 

25 Figure 7 is a flow diagram illustrating one embodiment of a process 200 for comparing a new 

nucleotide or protein sequence with a database of sequences in order to determine the homology levels 
between the new sequence and the sequences in the database. The database of sequences can be a 
private database stored within the computer system 100, or a public database such as GENBANK, PIR 
OR SWISSPROT that is available through the Internet. 

30 The process 200 begins at a start state 201 and then moves to a state 202 wherein the new 

sequence to be compared is stored to a memory in a computer system 1 00. As discussed above, the 
memory could be any type of memory, including RAM or an internal storage device. 

The process 200 then moves to a state 204 wherein a database of sequences is opened for 
analysis and comparison. The process 200 then moves to a state 206 wherein the first sequence stored in 

35 the database is read into a memory on the computer. A comparison is then performed at a state 2 1 0 to 
determine if the first sequence is the same as the second sequence. It is important to note that this step is 
not limited to performing an exact comparison between the new sequence and the first sequence in the 
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database. Well-known methods are known to those of skill in the art for comparing two nucleotide or 
protein sequences, even if they are not identical. For example, gaps can be introduced into one sequence 
in order to raise the homology level between the two tested sequences. The parameters that control 
whether gaps or other features are introduced into a sequence during comparison are normally entered by 
5 the user of the computer system. 

Once a comparison of the two sequences has been performed at the state 210, a determination is 
made at a decision state 2 1 0 whether the two sequences are the same. Of course, the term "same" is not 
limited to sequences that are absolutely identical. Sequences that are within the homology parameters 
entered by the user will be marked as "same" in the process 200. 

10 If a determination is made that the two sequences are the same, the process 200 moves to a state 

214 wherein the name of the sequence from the database is displayed to the user. This state notifies the 
user that the sequence with the displayed name fulfills the homology constraints that were entered. Once 
the name of the stored sequence is displayed to the user, the process 200 moves to a decision state 2 1 8 
wherein a determination is made whether more sequences exist in the database. If no more sequ»ices 

1 5 exist in the database, then the process 200 terminates at an end state 220. However, if more sequences 
do exist in the database, then the process 200 moves to a state 224 wherein a pointer is moved to the next 
sequence in the database so that it can be compared to the new sequence. In this manner, the new 
sequence is aligned and compared with every sequence in the database. 

It should be noted that if a determination had been made at the decision state 212 that the 

20 sequences were not homologous, then the process 200 would move immediately to the decision state 218 
in order to determine if any other sequences were available in the database for comparison. 

Accordingly, one aspect of the present invention is a computer system comprising a 
processor, a data storage device having stored thereon a nucleic acid code of SEQ ID NOs. 24-81 1 
and 1600-1622 or a polypeptide code of SEQ ID NOS. 812-1599, a data storage device having 

25 retrievably stored thereon reference nucleotide sequences or polypeptide sequences to be compared 
to the nucleic acid code of SEQ ID NOs. 24-81 1 and 1600-1622 or polypeptide code of SEQ ID 
NOS. 812-1599 and a sequence comparer for conducting the comparison. The sequence comparer 
may indicate a homology level between the sequences compared or identify structural motifs in the 
above described nucleic acid code of SEQ ID NOs. 24-81 1 and 1600-1622 and polypeptide codes of 

30 SEQ ID NOS. 812-1 599 or it may identify structural motifs in sequences which are compared to 
these nucleic acid codes and polypeptide codes. In some embodiments, the data storage device may 
have stored thereon the sequences of at least 2, 5, 10, 15, 20, 25, 30, or 50 of the nucleic acid codes of 
SEQ ID NOs. 24-81 1 and 1600-1622 or polypeptide codes of SEQ ID NOS. 812-1599. 

Another aspect of the present invention is a method for determining the level of homology 

35 between a nucleic acid code of SEQ ID NOs. 24-81 1 and 1600-1622 and a reference nucleotide 
sequence, comprising the steps of reading the nucleic acid code and the reference nucleotide sequence 
through the use of a computer program which determines homology levels and determining homology 
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between the nucleic acid code and the reference nucleotide sequence with the computer program. The 
computer program may be any of a number of computer programs for detemining homology levels, 
including those specifically enumerated herein, including BLAST2N with the default parameters or with 
any modified parameters. The method may be implemented using the computer systems described 
5 above. The method may also be performed by reading 2, 5, 10, 15, 20, 25, 30, or 50 of the above 
described nucleic acid codes of SEQ ID NOs. 24-81 1 and 1600-1622 through use of the computer 
program and determining homology between the nucleic acid codes and reference nucleotide 
sequences . 

Figure 8 is a flow diagram illustrating one embodiment of a process 250 in a computer for 

1 0 determining whether two sequences are homologous. The process 250 begins at a start state 252 and 
then moves to a state 254 wherein a first sequence to be compared is stored to a memory. The 
second sequence to be compared is then stored to a memory at a state 256. The process 250 then 
moves to a state 260 wherein the first character in the first sequence is read and then to a state 262 
wherein the first character of the second sequence is read. It should be understood that if the 

1 5 sequence is a nucleotide sequence, then the character would normally be either A, T, C, G or U. If 
the sequence is a protein sequence, then it should be in the single letter amino acid code so that the 
first and sequence sequences can be easily compared. 

A determination is then made at a decision state 264 whether the two characters are the same. 
If they are the same, then the process 250 moves to a state 268 wherein the next characters in the first 

20 and second sequences are read. A determination is then made whether the next characters are the 
same. If they are, then the process 250 continues this loop until two characters are not the same. If a 
determination is made that the next two characters are not the same, the process 250 moves to a 
decision state 274 to determine whether there are any more characters either sequence to read. 

If there aren't any more characters to read, then the process 250 moves to a state 276 wherein 

25 the level of homology between the first and second sequences is displayed to the user. The level of 
homology is determined by calculating the proportion of characters between the sequences that were 
the same out of the total number of sequences in the first sequence. Thus, if every character in a first 
100 nucleotide sequence aligned with a every character in a second sequence, the homology level 
would be 100%. 

30 Alternatively, the computer program may be a computer program which compares the 

nucleotide sequences of the nucleic acid codes of the present invention, to reference nucleotide 
sequences in order to determine whether the nucleic acid code of SEQ ID NOs. 24-81 1 and 1600-1622 
differs fix)m a reference nucleic acid sequence at one or more positions. Optionally such a program 
records the length and identity of inserted, deleted or substituted nucleotides with respect to the sequence 

35 ofeither the reference polynucleotide or the nucleic acid code ofSEQ ID NOs. 24-811 and 1600-1622. 
In one embodiment, the computer program may be a program which determines whether the nucleotide 
sequences of the nucleic acid codes of SEQ ID NOs. 24-81 1 and 1600-1622 contain a biallelic marker 
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or single nucleotide polymoq)hism (SNP) with respect to a reference nucleotide sequence. This single 
nucleotide polymorphism may comprise a single base substitution, insertion, or deletion, while this 
biallelic marker may comprise abour one to ten consecutive bases substituted, inserted or deleted. 
Another aspect of the present invention is a method for determining the level of homology 
5 between a polypeptide code of SEQ ID NOS. 812-1599 and a reference polypeptide sequence, 
comprising the steps of reading the polypeptide code of SEQ ID NOS. 812-1599 and the reference 
polypeptide sequence through use of a computer program which determines homology levels and 
determining homology between the polypeptide code and the reference polypeptide sequence using the 
computer program. 

1 0 Accordingly, another aspect of the present invention is a method for determining whether a 

nucleic acid code of SEQ ID NOs. 24-81 1 and 1600-1622 differs at one or more nucleotides from a 
reference nucleotide sequence comprising the steps of reading the nucleic acid code and the reference 
nucleotide sequence through use of a computer program which identifies differences between nucleic 
acid sequences and identifying differences between the nucleic acid code and the reference nucleotide 

1 5 sequence with the computer program. In some embodiments, the computer program is a program which 
identifies single nucleotide polymorphisms. The method may be implemented by the computer systems 
described above and the method illustrated in Figure 8. The method may also be performed by reading at 
least 2, 5, 10, 15,20, 25,30, or 50 ofthe nucleic acid codes ofSEQ ID NOs. 24-811 and 1600-1622 and 
the reference nucleotide sequences through the use ofthe computer program and identifying differences 

20 between the nucleic acid codes and the reference nucleotide sequences with the computer program. 

In other embodiments the computer based system may further comprise an identifier for 
identifying features within the nucleotide sequences of the nucleic acid codes of SEQ ID NOs. 24-81 1 
and 1600-1622 or the amino acid sequences of the polypeptide codes of SEQ ID NOS. 812-1599. 

An "identifier" refers to one or more programs which identifies certain features within the 

25 above-described nucleotide sequences of the nucleic acid codes of SEQ ID NOs. 24-81 1 and 1600- 
1622 or the amino acid sequences of the polypeptide codes of SEQ ID NOS. 812-1599. In one 
embodiment, the identifier may comprise a program which identifies an open reading firame in the 
cDNAs codes of SEQ ID NOs. 24-81 1 and 1 600-1622. 

Figure 9 is a flow diagram illustrating one embodiment of an identifier process 300 for 

30 detecting the presence of a feature in a sequence. The process 300 begins at a start state 302 and then 
moves to a state 304 wherein a first sequence that is to be checked for features is stored to a memory 
1 15 in the computer system 100. The process 300 then moves to a state 306 wherein a database of 
sequence features is opened. Such a database would include a list of each feature's attributes along 
with the name of the feature. For example, a feature name could be ^'Initiation Codon" and the 

35 attribute would be "ATG". Another example would be the feature name "TAATAA Box" and the 
feature attribute would be "TAATAA". An example of such a database is produced by the 
University of Wisconsin Genetics Computer Group (www.gcg.com). 
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Once the database of features is opened at the state 306, the process 300 moves to a state 308 
wherein the first feature is read from the database. A comparison of the attribute of the first feature 
with the first sequence is then made at a state 310. A determination is then made at a decision state 
316 whether the attribute of the feature was found in the first sequence. If the attribute was found, 
5 then the process 300 moves to a state 3 1 8 wherein the name of the found feature is displayed to the 
user. 

The process 300 then moves to a decision state 320 wherein a determination is made whether 
move features exist in the database. If no more features do exist, then the process 300 terminates at 
an end state 324. However, if more features do exist in the database, then the process 300 reads the 

10 next sequence feature at a state 326 and loops back to the state 310 wherein the attribute of the next 
feature is compared against the first sequence. 

It should be noted, that if the feature attribute is not found in the first sequence at the 
decision state 3 1 6, the process 300 moves directly to the decision state 320 in order to determine if 
any more features exist in the database. 

15 In another embodiment, the identifier may comprise a molecular modeling program which 

determines the 3-dimensional structure of the polypeptides codes of SEQ ID NOS. 812-1599. In 
some embodiments, the molecular modeling program identifies target sequences that are most 
compatible with profiles representing the structural environments of the residues in known three- 
dimensional protein structures. (See, e.g., Eisenberg et al., U.S. Patent No. 5,436,850 issued July 25, 

20 1995). In another technique, the known three-dimensional structures of proteins in a given family are 
superimposed to define the structurally conserved regions in that family. This protein modeling 
technique also uses the known three-dimensional structure of a homologous protein to approximate 
the structure of the polypeptide codes of SEQ ID NOS. 812-1599. (See e.g., Srinivasan, et al., U.S. 
Patent No. 5,557,535 issued September 17, 1996). Conventional homology modeling techniques 

25 have been used routinely to build models of proteases and antibodies. (Sowdhamini et al.. Protein 
Engineering 10:207, 215 (1997)). Comparative approaches can also be used to develop three- 
dimensional protein models when the protein of interest has poor sequence identity to template 
proteins. In some cases, proteins fold into similar three-dimensional structures despite having very 
weak sequence identities. For example, the three-dimensional structures of a number of helical 

30 cytokines fold in similar three-dimensional topology in spite of weak sequence homology. 

The recent development of threading methods now enables the identification of likely folding 
patterns in a number of situations where the structural relatedness between target and tcmplate(s) is 
not detectable at the sequence level. Hybrid methods, in which fold recognition is performed using 
Multiple Sequence Threading (MST), structural equivalencies are deduced from the threading output 

35 using a distance geometry program DRAGON to construct a low resolution model, and a full-atom 
representation is constructed using a molecular modeling package such as QUANTA. 
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According to this 3-step approach, candidate templates are first identified by using the novel 
fold recognition algorithm MST, which is capable of performing simultaneous threading of multiple 
aligned sequences onto one or more 3-D structures. In a second step, the structural equivalencies 
obtained from the MST output are converted into interresidue distance restraints and fed into the 
5 distance geometry program DRAGON, together with auxiliary information obtained from secondary 
structure predictions. The program combines the restraints in an unbiased manner and rapidly 
generates a large number of low resolution model confirmations. In a third step, these low resolution 
model confirmations are converted into full-atom models and subjected to energy minimization using 
the molecular modeling package QUANTA. (See e.g., Aszodi et al.. Proteins: Structure, Function, 
10 and Genetics, Supplement 1 :38-42 (1997)). 

The results of the molecular modeling analysis may then be used in rational drug design 
techniques to identify agents which modulate the activity of the polypeptide codes of SEQ ID NOS. 
812-1599. 

Accordingly, another aspect of the present invention is a method of identifying a feature 

15 within the nucleic acid codes of SEQ ID NOs. 24-81 1 and 1600-1622 or the polypeptide codes of 
SEQ ID NOS. 812-1599 comprising reading the nucleic acid code(s) or the polypeptide code(s) 
through the use of a computer program which identifies features therein and identifying features 
within the nucleic acid code(s) or polypeptide code(s) with the computer program. In one 
embodiment, computer program comprises a computer program which identifies open reading 

20 frames. In a further embodiment, the computer program identifies structural motifs in a polypeptide 
sequence. In another embodiment, the computer program comprises a molecular modeling program. 
The method may be performed by reading a single sequence or at least 2, 5, 10, 15, 20, 25, 30, or 50 of 
the nucleic acid codes of SEQ ID NOs. 24-81 1 and 1600-1622 or the polypeptide codes of SEQ ID 
NOS. 812-1599 through the use of the computer program and identifying features within the nucleic 

25 acid codes or poljqjeptide codes with the computer program. 

The nucleic acid codes of SEQ ID NOs. 24-81 1 and 1600-1622 or the polypeptide codes of 
SEQ ID NOS. 812-1599 may be stored and manipulated in a variety of data processor programs in a 
variety of formats. For example, the nucleic acid codes of SEQ ID NOs. 24-81 1 and 1600-1622 or the 
polypeptide codes of SEQ ID NOS. 812-1599 maybe stored as text in a word processing file, such as 

30 Microsoft WORD or WORDPERFECT or as an ASCII file in a variety of database programs familiar to 
those of skill in the art, such as DB2, SYBASE, or ORACLE. In addition, many computer programs and 
databases may be used as sequence comparers, identifiers, or sources of reference nucleotide or 
polypeptide sequences to be compared to the nucleic acid codes of SEQ ID NOs. 24-81 1 and 1600- 
1 622 or the polypeptide codes of SEQ ID NOS. 8 1 2-1 599. The following list is intended not to limit 

35 the invention but to provide guidance to programs and databases which are usefiil with the nucleic acid 
codes of SEQ ID NOs. 24-81 1 and 1600-1622 or the polypeptide codes of SEQ ID NOS. 812-1599. 
The programs and databases which may be used include, but are not limited to: MacPattem (EMBL), 



wo 99/53051 PCT/IB99/00712 

147 

DiscoveryBase (Molecular Applications Group), GeneMine (Molecular Applications Group), Look 
(Molecular Applications Group), MacLook (Molecular Applications (jroup), BLAST and BLAST2 
(NCBI), BLASTN and BLASTX (Altschul et al, 1 Mol Biol 215: 403 (1990)), FASTA (Pearson and 
Lipman, Proc. Natl Acad. ScL USA, 85: 2444 ( 1 988)), FASTDB (Brutlag et al, Comp. App. Biosci. 
5 6:237-245, 1990), Catalyst (Molecular Simulations Inc.), Catalyst/SHAPE (Molecular Simulations Inc.), 
Cerius^.DBAccess (Molecular Simulations Inc.), HypoGen (Molecular Simulations Inc.), Insight D, 
(Molecular Simulations Inc.), Discover (Molecular Simulations Inc.), CHARMm (Molecular 
Simulations Inc.), Felix (Molecular Simulations Inc.), DelPhi, (Molecular Simulations Inc.), QuanteMM, 
(Molecular Simulations Inc.), Homology (Molecular Simulations Inc.), Modeler (Molecular Simulations 

1 0 Inc.), ISIS (Molecular Simulations Inc.), Quanta/Protein Design (Molecular Simulations Inc.), WebLab 
(Molecular Simulations Inc.), WebLab Diversity Explorer (Molecular Simulations Inc.), Gene Explorer 
(Molecular Simulations Inc.), SeqFold (Molecular Simulations Inc.), the EMBL/Swissprotein database, 
the MDL Available CJiemicals Directory database, the MDL Drug Data Report data base, the 
Comprehensive Medicinal Chemistry database, Derwents's World Drug Index database, the 

1 5 BioByteMasterFile database, the Genbank database, and the Genseqn database. Many other programs 
and data bases would be apparent to one of skill in the art given the present disclosure. 

Motifs which may be detected using the above programs include sequences encoding leucine 
zippers, helix-tum-helix motifs, glycosylation sites, ubiquitination sites, alpha helices, and beta 
sheets, signal sequences encoding signal peptides which direct the secretion of the encoded proteins, 

20 sequences implicated in transcription regulation such as homeoboxes, acidic stretches, enzymatic 
active sites, substrate binding sites, and enzymatic cleavage sites. 

EXAMPLE 62 
Methods of Making Nucleic Acids 

25 The present invention also comprises methods of making the EST-related nucleic acids, 

fragments of EST-related nucleic acids, positional segments of the EST-related nucleic acids, or 
fragments of positional segments of the EST-related nucleic acids. The methods comprise 
sequentially linking together nucleotides to produce the nucleic acids having the preceding 
sequences. A variety of methods of synthesizing nucleic acids are known to those skilled in the art. 

30 In many of these methods, synthesis is conducted on a solid support. These included the 3* 

phosphoramidite methods in which the 3' terminal base of the desired oligonucleotide is immobilized 
on an insoluble carrier. The nucleotide base to be added is blocked at the 5' hydroxyl and activated 
at the 3' hydroxyl so as to cause coupling with the immobilized nucleotide base. Deblocking of the 
new immobilized nucleotide compound and repetition of the cycle will produce the desired 

35 polynucleotide. Alternatively, polynucleotides may be prepared as described in U.S. Patent No. 
5,049,656. hi some embodiments, several polynucleotides prepared as described above are ligated 
together to generate longer polynucleotides having a desired sequence. 
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EXAMPLE 63 
Methods of Making Polypeptides 
The present invention also comprises methods of making the polynucleotides encoded by EST- 
5 related nucleic acids, fragments of EST-related nucleic acids, positional segments of the EST-related 
nucleic acids, or fragments of positional segments of the EST-related nucleic acids and methods of 
making the EST-related polypeptides, fragments of EST-related polypeptides, positional segments of 
EST-related polypeptides, or fragments of EST-related polypeptides. The methods comprise 
sequentially linking together amino acids to produce the nucleic polypeptides having the preceding 
10 sequences. In some embodiments, the polypeptides made by these methods are 150 amino acid or 
less in length. In other embodiments, the polypeptides made by these methods are 120 amino acids 
or less in length. 

A variety of methods of making polypeptides are known to those skilled in the art, including 
methods in which the carboxyl terminal amino acid is bound to polyvinyl benzene or another suitable 

1 5 resin. The amino acid to be added possesses blocking groups on its amino moiety and any side chain 
reactive groups so that only its carboxyl moiety can react. The carboxyl group is activated with 
carbodiimide or another activating agent and allowed to couple to the immobilized amino acid. After 
removal of the blocking group, the cycle is repeated to generate a polypeptide having the desired 
sequence. Alternatively, the methods described in U.S. Patent No. 5,049,656 may be used. 

20 As discussed above, the EST-related nucleic acids, fragments of the EST-related nucleic 

acids, positional segments of the EST-related nucleic acids, or fragments of positional segments of 
the EST-related nucleic acids can be used for various purposes. The polynucleotides can be used to 
express recombinant protein for analysis, characterization or therapeutic use; production of secreted 
polypeptides or chimeric polypeptides, antibody production, as markers for tissues in which the 

25 corresponding protein is preferentially expressed (either constitutively or at a particular stage of tissue 
differentiation or development or in disease states); as molecular weight markers on Southern gels; as 
chromosome markers or tags (when labeled) to identify chromosomes or to map related gene positions; 
to compare with endogenous DNA sequences in patients to identify potential genetic disorders; as probes 
to hybridize and thus discover novel, related DNA sequences; as a source of information to derive PGR 

30 primers for genetic fingerprinting; for selecting and making oligomers for attachment to a "gene chip" or 
oflier support, including for examination for expression patterns; to raise anti-protein antibodies using 
DNA immunization techniques; and as an antigen to raise anti-DNA antibodies or elicit another immune 
response. Where the polynucleotide encodes a protein or polypeptide which binds or potentially binds to 
another protein or polypeptide (such as, for example, in a receptor-ligand interaction), the polynucleotide 

35 can also be used in interaction trap assays (such as, for example, that described in Gyuris et al^ Cell 
75:791-803 (1993)) to identify polynucleotides encoding the other protein or polypeptide with which 
binding occurs or to identify inhibitors of the binding interaction. 
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The proteins or polypeptides provided by the present invention can similarly be used in assays to 
determine biological activity, including in a panel of multiple proteins for high-throughput screening; to 
raise antibodies or to elicit another immune response; as a reagent (including the labeled reagent) in 
assays designed to quantitatively determine levels of the protein (or its receptor) in biological fluids; as 
5 markers for tissues in which the corresponding protein is preferentially expressed (either constitutively or 
at a particular stage of tissue differentiation or development or in a disease state); and, of course, to 
isolate correlative receptors or Hgands. Where the protein or polypeptide binds or potentially binds to 
another protein or polypeptide (such as, for example, in a receptor-ligand interaction), the protein can be 
used to identify the other protein with which binding occurs or to identify inhibitors of the binding 
1 0 interaction. Proteins or polypeptides involved in these binding interactions can also be used to screen for 
peptide or small molecule inhibitors or agonists of the binding interaction. 

Any or all of these research utilities are capable of being developed into reagent grade or kit 
format for commercialization as research products. 

Methods for performing the uses listed above are well known to those skilled in the art. 
1 5 References disclosing such methods include without limitation "Molecular Cloning; A Laboratory 
Manual " 2d ed.. Cold Spring Harbor Laboratory Press, Sambrook, J., E.F. Fritsch and T. Maniatis eds., 
1989, and "Methods in Enzymology; Guide to Molecular Cloning Techniques," Academic Press, 
Berger, S.L. and A.R. Kimmel eds., 1987. 

Polynucleotides and proteins or polypeptides of the present invention can also be used as 
20 nutritional sources or supplements. Such uses include without limitation use as a protein or amino acid 
supplement, use as a carbon source, use as a nitrogen source and use as a source of carbohydrate. In 
such cases the protein or polynucleotide of the invention can be added to the feed of a particular 
organism or can be administered as a separate solid or liquid preparation, such as in the form of powder, 
pills, solutions, suspensions or capsules. In the case of microorganisms, the protein or polynucleotide of 
25 the invention can be added to the medium in or on which the microorganism is cultured. 

Although this invention has been described in terms of certain preferred embodiments, other 
embodiments which will be apparent to those of ordinary skill in the art in view of the disclosure herein 
are also within the scope of this invention. Accordingly, the scope of the invention is intended to be 
limited only by reference to the appended claims. 

30 

Se quence Listing Free Text 

The following firee text appears in the accompanying Sequence Listing: 
Von Heijne matrix 
score 
35 sequence 
name 

martinspector prediction 
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1. A purified nucleic acid comprising a sequence selected from the group consisting of SEQ 
ID NOs. 24-8 1 1 and SEQ ID NOs. 1600-1622 and sequences complementary to the sequences of 

5 SEQ ID NOs. 24-81 1 and SEQ ID NOs. 1600-1622. 

2. A purified nucleic acid comprising at least 15 consecutive nucleotides of a sequence 
selected fi-om the group consisting of SEQ ID NOs. 24-81 1 and SEQ ID NOs. 1600-1622 and 
sequences complementary to the sequences of SEQ ID NOs. 24-8 1 1 and SEQ ID NOs. 1600-1 622. 

10 

3. A purified or isolated polypeptide comprising a sequence selected from the group 
consisting of the sequences of SEQ ID NOs. 812-1599. 

4. A method of making a cDNA comprising the steps of: 

15 a) contacting a collection of mRNA molecules firom human cells with a primer 

comprising at least 15 consecutive nucleotides of a sequence selected fi-om the group consisting of 
the sequences complementary to SEQ ID NOs. 24-8 1 1 and SEQ ID NOs. 1 600-1 622; 

b) hybridizing said primer to an mRNA in said collection that encodes said protein; 

c) reverse transcribing said hybridized primer to make a first cDNA strand fi'om said 

20 mRNA; 

d) making a second cDN A strand complementary to said first cDNA strand; and 

e) isolating the resulting cDNA comprising said first cDNA strand and said second 
cDNA strand. 

25 5. A method of making a cDNA comprising the steps of: 

a) obtaining a cDNA comprising a sequence selected fi'om the group consisting of SEQ 
ID NOs. 24-81 1 and SEQ ID NOs. 1600-1622; 

b) contacting said cDNA with a detectable probe comprising at least 15 consecutive 
nucleotides of a sequence selected fi'om the group consisting of SEQ ID NOs. 24-81 1 and SEQ ID 

30 NOs. 1600-1622 and the sequences complementary to SEQ ID NOs. 24-81 1 and SEQ ID NOs. 1600- 
1622 imder conditions which permit said probe to hybridize to said cDNA; 

c) identifying a cDNA which hybridizes to said detectable probe; and 

d) isolating said cDNA which hybridizes to said probe. 

35 6. A method of making a cDNA comprising the steps of: 

a) contacting a collection of mRNA molecules fi'om human cells with a first primer 
capable of hybridizing to the polyA tail of said mRNA; 

b) hybridizing said first primer to said polyA tail; 
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c) reverse transcribing said mRNA to make a first cDNA strand; 

d) making a second cDNA strand complementary to said first cDNA strand using at least 
: primer comprising at least 15 consecutive nucleotides of a sequence selected firom the group 

consisting of SEQ ID NOs. 24-81 1 and SEQ ID NOs. 1600-1622: and 
5 e) isolating the resulting cDNA comprising said first cDNA strand and said second 

cDNA strand. 

7. A method of njaking a polypeptide comprising the steps of 

a) obtaining a cDNA which encodes a polypeptide encoded by a nucleic acid comprising 
10 asequenceselectedfromthegroupconsistingof SEQIDNOs.24-811 oracDNAwhichencodesa 

polypeptide comprising at least 10 consecutive amino acids of a polypeptide encoded by a sequence 
selected from the group consisting of SEQ ID NOs. 24-8 1 1 ; 

b) inserting said cDNA in an ©tpression vector such that said cDNA is operably linked to 

a promoter; 

1 5 c) introducing said expression vector into a host cell whereby said host cell produces the 

protein encoded by said cDNA; and 
d) isolating said protein. 

8. In an array of discrete ESTs or fiagments thereof of at least 15 nucleotides in length, the 
20 improvement comprising inclusion in said array of at least one sequence selected fi-om the group 

consisting of SEQ ID NOs. 24-811 and SEQ ID NOs. 1600-1622, the sequences complementar>' to 
the sequences of SEQ ID NOs. 24-81 1 and SEQ ID NOs. 1600-1622 and fragments comprising at 
least 15 consecurive nucleotides of said sequence. 

25 9. The array of Claim 8 including therein at least five sequences selected from the group 

consisting of SEQ ID NOs. 24-81 1 and SEQ ID NOs. 1600-1622, the sequences complementary to 
the sequences of SEQ ID NOs. 24-81 1 and SEQ ID NOs. 1600-1622 and fragments comprising at 
least 15 consecutive nucleotides of said sequences. 

30 10. An enriched peculation of recombinant nucleic acids, said recombinant nucleic acids 

comprising an insert nucleic acid and a backbone nucleic acid; wherein at least 5% of said insert 
nucleic acids in said population comprise a sequence selected from the group consisting of SEQ ID 
NOs. 24-8 1 1 and SEQ ID NOs. 1600-1622, the sequences complementary to SEQ ID NOs. 24-8 1 1 
and SEQ ID NOs. 1600-1622 and fragments comprising at least 15 consecutive nucleotides of said 

35 sequences. 

1 1. An antibody composition capable of selectively binding to an epitope-containing 
fragment of a polypeptide comprising a contiguous span of at least 8 amino acids of any of SEQ ID 
NOs. 812-1599, wherein said antibody is polyclonal or monoclonal. 
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12. A computer readable medium having stored thereon a sequence selected from the group 
consisting of a nucleic acid code of SEQ ID NOs. 24-81 1 and 1600-1622 and a polypeptide code of 
SEQID NOs. 812-1599. 

1 3. A computer system comprising a processor and a data storage device wherein said data 
storage device has stored thereon a sequence selected from the group consisting of a nucleic acid 
code of SEQID NOs. 24-81 1 and 1600-1622 and a polypeptide code of SEQ ID NOs. 812-1599. 

10 14. The computer system of Claim 13 further comprising a sequence comparer and a data 

storage device having reference sequences stored thereon. 

15. The computer system of Claim 14 wherein said sequence comparer comprises a 
computer program which indicates polymorphisms. 

15 

16. The computer system of Claim 1 3 further comprising an identifier which identifies 
features in said sequence. 

17. A method for comparing a first sequence to a reference sequence wherein said first 
20 sequence is selected from the group consisting of a nucleic acid code of SEQID NOs. 24-8 1 1 and 

1600-1622 and a polypeptide code of SEQ ID NOs. 812-1599 comprising the steps of 

a) reading said first sequence and said reference sequence through use of a computer 
program which compares sequences; and 

b) determining differences between said first sequence and said reference sequence with 
25 said computer program. 

1 8. The method of Claim 17, wherein said step of determining differences between the first 
sequence and the reference sequence comprises identifying polymorphisms. 

30 19. A method for identifying a feature in a sequence selected from the group consisting of a 

nucleic acid code of SEQID NOs. 24-811 and 1600-1622 and a polypeptide code of SEQ ID NOs. 
812-1599 comprising the steps of: 

a) reading said sequence through the use of a computer program which identifies features in 

sequences; and 

35 b) identifying features in said sequence with said computer program. 

20. A vector comprising a nucleic acid according to either Claims 1 or 2, 



2 1 . A host cell containing a nucleic acid of Claim 20. 

40 
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Cap 



mRNA I A...A 3 ' 



1) 




]A...A3' 



2) 



Amine-containing tad. 




JA...A 3 ■ 



3' I RTtag 



cDNA 1st strand I Primerl 5' 



3) 



5' I Primerl 



cUNA zna strana 



3' 



1 — r 



1 — I 



Figure 1 
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Minimum 
signal 

peptide 
score 


false 
positive 
rate 


false 
negative 
rate 


proba(O.I) 


proba(0.2) 


3.5 


0,121 


0,036 


0,467 


0.664 


4 


0.096 


0,06 


0.519 


0,708 


4.5 


0.078 


0,079 


0,565 


0,745 


5 


0.062 


0,098 


0.615 


0,782 


5,5 


0.05 


0,127 


0.659 


0.813 


6 


0,04 


0,163 


0,694 


0,836 


6,5 


0,033 


0.202 


0.725 


0,855 


7 


0,025 


0,248 


0,763 


0,878 


7,5 


0,021 


0,304 


0.78 


0,889 


8 


0,015 


0,368 


0,816 


0,909 


8.5 


0,012 


0,418 


0,836 


0,92 


9 


0,009 


0,512 


0,856 


0,93 


9.5 


0.007 


0,581 


0,863 


0,934 


10 


0.006 


0,679 


0.835 


0.919 



Figure 2 
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1 nested primers 
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-TTT 



PCR 
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Figure 3 
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Promoter sequence P13H2 (546 bp): 

Orient 

Matrix 
CMYB^OI 
MY0D_Q6 
S8_01 
S8_01 

DELTAEF1_01 
GATA^C 
CMYB_01 
GATA1_02 
GATA^C 

TAL1ALPHAE47_01 
TAL1BETAE47_01 
TAL1BETAITF2_01 
MY0D_Q6 
GATA1_04 
IK1_01 
IK2_01 
CREL^OI 
GATA1_02 
SRY_02 
E2F 02 
MZF1.01 

Promoter sequence 



Position 


ation 


Score 


Length 


-502 


+ 


0.983 


9 


-501 


- 


0.961 


10 


-444 


- 


0.960 


11 


-425 


+ 


0.966 


11 


-390 


- 


0.960 


11 


-364 


- 


0.964 


11 


-349 


+ 


0.958 


9 


-343 


+ 


0.959 


14 


-339 


+ 


0.953 


11 


-235 


+ 


0.973 


16 


-235 


+ 


0.983 


16 


-235 


+ 


0.978 


16 


-232 




0.954 


10 


-217 




0.953 


13 


.126 


+ 


0.963 


13 


-126 


+ 


0.985 


12 


-123 


+ 


0.962 


10 


-96 


+ 


0.950 


14 


-41 




0.951 


12 


-33 


+ 


0.957 


8 


-5 




0.975 


8 



Sequence 

TGTCAGTTG 

CCCAACTGAC 

AATAGAATTAG 

AACT/\AATTAG 

GCAGACCTCAG 

AGATAAATCCA 

CTTCAGTTG 

TTGTAGATAGGACA 

AGATAGGACAT 

CATAACAGATGGTAAG 

CATAAGAGATGGTAAG 

CATAACAGATGGTAAG 

ACCATCTGTT 

TCAAGATAAAGTA 

AGTTGGGAATTCC 

AGTTGGG/VVTTC 

TGGGAATTCC 

TCAGTGATATGGCA 

TAAAACAAAACA 

TTTAGCGC 

TGAGGGGA 



Matrix 

NFY_Q6 

MZF1_01 

CIVIYB^OI 

VMYB_02 

STAT_01 

STAT_01 

MZFI.OI 

IK2 01 

MZF1_01 

SRY_02 

MZFI^OI 

MY0D_Q6 

DELTAEFI^OI 

88 01 

MZF1 01 



P15B4 (BBIbp) : 
Orient 
Position ation 
. -748 - 
-738 + 
-684 + 
-682 - 
-673 + 
-673 - 
-556 - 
-451 
-424 
-398 
-216 
-190 
-176 
5 
16 



+ 
+ 

+ 
+ 



Score 
0.956 
0.962 
0.994 
0.985 
0.968 
0.951 
0.956 
0.965 
0.986 
0.955 
0.960 
0.981 
0.958 
0.992 
0.986 



Length 
11 
8 
9 
9 
9 
9 
8 
12 
8 

12 
8 
10 
11 
11 
8 



Promoter sequence P29B6 (555 bp) 

Orient 

Matrix 

ARNT^OI 
NMYC_01 
USF^OI 
USF_01 
NMYC^OI 
MYCMAX_02 
USF.C 
USF^C 
MZF1J)1 
ELK1_02 
CETS1P54_01 
AP1_Q4 
AP1FJ_Q2 
PADS G 



Position 


ation 


Score 


Length 


-311 


+ 


0.964 


16 


-309 


+ 


0.965 


12 


-309 


+ 


0.985 


12 


-309 




0.985 


12 


-309 




0.956 


12 


-309 




0.972 


12 


-307 


+ 


0.997 


8 


-307 




0.991 


8 


-292 




0.968 


8 


-105 


+ 


0.963 


14 


-102 


+ 


6.974 


10 


-42 




0.963 


11 


-42 




0.961 


11 


45 


+ 


1.000 


9 



Sequence 

GGACC/\ATCAT 

CCTGGGGA 

TGACCGTTG 

TCC/>^CGGT 

TTCGTGGAA 

TTCCAGGAA 

TTGGGGGA 

GAATGGGATTTC 

AGAGGGGA 

GAAAACA/VAACA 

GAAGGGGA 

AGCATCTGCC 

TCCCACCTTCC 

GAGGCAATTAT 

AGAGGGGA 



Sequence 

ggactcacgtgctgct 

actcacgtgctg 

actcacgtgctg 

cagcacgtgagt 

ca6cacgtgagt 

cagcacgtgagt 

TCACGT6C 

GCACGTGA 

CATGGGGA 

CTCTCCGG/VAGCCT 

TCCGGAAGCC 

AGTGACTGAAC 

AGTGACTGAAC 

TGTGGTCTC 



Figure 5 



wo 99/53051 



PCT/1B99/00712 




118 

J 



120 



DATA 
RETRIEVING 
DEVICE 



DISPLAY 



J 



FIGURES 



wo 99/53051 



PCT/IB99/007I2 



7/10 

201 



200 



c 



START 



NO 



STORE NEW SEQUENCE TO A MEMORY 



OPEN DATABASE OF SEQUENCES 



READ FIRST SEQUENCE IN DATABASE 



PERFORM COMPARISON OF NEW SEQUENCE AND 
STORED SEQUENCE 




212 



DISPLAY STORED SEQUENCE NAME TO USER 



202 



204 



206 



210 



224 



GO TO NEXT 
SEQUENCE IN 
DATABASE 



214 




V 



YES- 
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FIGURE 8 
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SEQUENCE LISTING 

<110> Dumas Milne Edwards, J.B. 
Duclert A. 
Giordano, J.Y. 
Genset SA 

<120> ESTs and Encoded Human Proteins. 

<130> D18118-339 881 

<150> 09/057,719 
<151> 1998-04-09 

<150> 09/069,047 
<151> 1998-04-28 

<160> 1622 

<170> Patent. pm 

<210> 1 
<211> 822 
<212> DNA 

<213> Homo Sapiens 

<220> 
<221> CDS 
<222> 346. .552 

<221> sig_peptide 

<222> 346. .408 

<223> Von Heijne matrix 

<221> misc_feature 
<222> 115 

<223> nsa, g, c or t 
<400> 1 

actcctttta gcataggggc ttcggcgcca gcggccagcg ctagtcggtc tggtaagtgc 60 
ctgatgccga gttccgtctc tcgcgtcttt tcctggtccc aggcaaagcg gasgnagatc 120 
ctcaaacggc ctagtgcttc gcgcttccgg agaaaatcag cggtctaatt aattcctctg 180 
gtttgttgaa gcagttacca agaatcttca accctttccc acaaaagcta attgagtaca 240 
cgttcctgtt gagtacacgt tcctgttgat ttacaaaagg tgcaggtatg agcaggtctg 300 
aagactaaca ttttgtgaag ttgtaaaaca gaaaacctgt tagaa atg tgg tgg ttt 357 

Met Trp Trp Phe 
-20 

cag caa ggc etc agt ttc ctt cct tea gee ctt gta att tgg aca tet 405 
Gin Gin Gly Leu Ser Phe Leu Pro Ser Ala Leu Val lie Trp Thr Ser 

-15 -10 -5 

get get ttc ata ttt tea tac att act gca gta aca etc cac cat ata 453 
Ala Ala Phe He Phe Ser Tyr He Thr Ala Val Thr Leu His His He 

15 10 15 

gac ccg get tta cct tat ate agt gae act ggt aca gta get cca raa 501 
Asp Pro Ala Leu Pro Tyr He Ser Asp Thr Gly Thr Val Ala Pro Xaa 

20 25 30 

aaa tgc tta ttt ggg gea atg eta aat att gcg gea gtt tta tgt caa 54 9 

Lys Cys Leu Phe Gly Ala Met Leu Asn He Ala Ala Val Leu Cys Gin 

3S 40 45 

aaa tagaaatcag gaarataatt caacttaaag aakttcattt catgaccaaa 602 
Lys 

ctcttcaraa acatgtettt acaagcatat etettgtatt gctttctaca etgttgaatt 662 
gtctggcaat atttctgcag tggaaaattt gatttarmta gttettgact gataaatatg 722 
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gtaaggtggg cttttccccc tgtgtaattg gctactatgt cttactgagc caagttgtaw 782 
tttgaaataa aatgatatga gagtgacaca aaaaaaaaaa 822 

<210> 2 
<211> 21 
<212> PRT 

<213> Homo Sapiens 

<220> 

<221> SIGNAL 
<222> 1. .21 

<223> Von Heijne matrix 
score 5.5 

seq SFLPSALVIWTSA/AF 
<400> 2 

Met Trp Trp Phe Gin Gin Gly Leu Ser Phe Leu Pro Ser Ala Leu Val 
15 10 15 

lie Trp Thr Ser Ala 
20 

<210> 3 
<211> 526 
<212> DNA 

<213> Homo Sapiens 

<220> 
<221> CDS 
<222> 90. .344 

<221> sig_peptide 

<222> 90. .140 

<223> Von Heijne matrix 

<221> misc_feature 
<222> 290 

<223> n=a, g, c or t 
<400> 3 

aatatrarac agctacaata ttccagggcc artcacttgc catttctcat aacagcgtca 60 



gagagaaaga actgactgar acgtttgag 


atg 


aag 


aaa 


gtt 


etc 


etc 


ctg 


ate 


113 


















Met 


Lys 


Lys 


Val 


Leu 


Leu 


Leu 


He 
























-15 










-10 




aca 


gcc 


ate 


ttg 


gca 


gtg 


get 


gtw 


ggt 


ttc 


cca 


gtc 


tct 


caa 


gac 


cag 


161 


Thr 


Ala 


lie 


Leu 


Ala 


Val 


Ala 


Val 


Gly 


Phe 
1 


Pro 


Val 


Ser 


Gin 
5 


Asp 


Gin 




gaa 


cga 


gaa 


aaa 


-5 

aga 


agt 


ate 


agt 


gac 


age 


gat 


gaa 


tta 


get 


tea 


ggr 


209 


Glu 


Arg 


Glu 


Lys 


Arg 


Ser 


lie 


Ser 


Asp 


Ser 


Asp 


Glu 


Leu 


Ala 


Ser 


Gly 








10 










15 










20 










wtt 


ttt 


gtg 


ttc 


cct 


tac 


cca 


tat 


cca 


ttt 


cgc 


cca 


ctt 


cca 


cca 


att 


257 


Xaa 


Phe 


Val 


Phe 


Pro 


Tyr 


Pro 


Tyr 


Pro 


Phe 


Arg 


Pro 


Leu 


Pro 


Pro 


He 






25 










30 










35 












cca 


ttt 


cca 


aga 


ttt 


cca 


tgg 


ttt 


aga 


cgt 


a an 


ttt 


cct 


att 


cca 


ata 


305 


Pro 


Phe 


Pro 


Arg 


Phe 


Pro 


Trp 


Phe 


Arg 


Arg 


Xaa 


Phe 


Pro 


He 


Pro 


He 




40 










45 










50 










55 




cct 


gaa 


tct 


gcc 


cct 


aca 


act 


ccc 


ctt 


cct 


age 


gaa 


aag 


taaacaaraa 


354 


Pro 


Glu 


Ser 


Ala 


Pro 


Thr 


Thr 


Pro 


Leu 


Pro 


Ser 


Glu 


Lys 











60 65 

ggaaaagtea erataaaeet ggtcaectga aattgaaatt gagecaette ettgaaraat 414 

eaaaattcet gttaataaaa raaaaacaaa tgtaattgaa atageaeaca geattctcta 474 

gtcaatatct ttagtgatet tetttaataa aeatgaaagc aaaaaaaaaa aa 526 



<210> 4 
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<211> 17 

<212> PRT 

<213> Homo Sapiens 

<220> 

<221> SIGNAL 
<222> 1. .17 

<223> Von Heijne matrix 
score 8.2 

seq LLLITAILAVAVG/FP 
<400> 4 

Met Lys Lys Val Leu Leu Leu lie Thr Ala lie Leu Ala Val Ala Val 
15 10 15 

Gly 

<210> 5 

<211> 848 

<212> DNA 

<213> Homo Sapiens 

<220> 
<221> CDS 
<222> 32. .697 

<221> sigjpeptide 

<222> 32 . .73 

<223> Von Heijne matrix 

<400> 5 

aactttgcct tgtgttttcc accctgaaag a atg ttg tgg ctg etc ttt ttt 52 

Met Leu Trp Leu Leu Phe Phe 
-10 

ctg gtg act gcc att cat get gaa etc tgt caa cca ggt gca gaa aat 100 
Leu Val Thr Ala lie His Ala Glu Leu Cys Gin Pro Gly Ala Glu Asn 

-5 15 
get ttt aaa gtg aga ctt agt ate aga aca get ctg gga gat aaa gca 148 
Ala Phe Lys Val Arg Leu Ser He Arg Thr Ala Leu Gly Asp Lys Ala 
10 15 20 25 

tat gee tgg gat ace aat gaa gaa tac etc ttc aaa geg atg gta get 196 
Tyr Ala Trp Asp Thr Asn Glu Glu Tyr Leu Phe Lys Ala Met Val Ala 

30 35 40 

ttc tec atg aga aaa gtt cec aae aga gaa gca aca gaa att tec cat 244 
Phe Ser Met Arg Lys Val Pro Asn Arg Glu Ala Thr Glu He Ser His 

45 50 55 

gtc eta ctt tgc aat gta ace cag agg gta tea ttc tgg ttt gtg gtt 
Val Leu Leu Cys Asn Val Thr Gin Arg Val Ser Phe Trp Phe Val Val 

60 65 70 

aca gac ect tea aaa aat cac ace ctt cet get gtt gag gtg caa tea 340 
Thr Asp Pro Ser Lys Asn His Thr Leu Pro Ala Val Glu Val Gin Ser 

75 80 85 

gee ata aga atg aae aag aac egg ate aae aat gee ttc ttt eta aat 388 
Ala He Arg Met Asn Lys Asn Arg He Asn Asn Ala Phe Phe Leu Asn 
90 95 100 105 

gac caa act ctg gaa ttt tta aaa ate cet tec aca ctt gca cca cec 436 
Asp Gin Thr Leu Glu Phe Leu Lys He Pro Ser Thr Leu Ala Pro Pro 

110 115 120 

atg gac cca tct gtg cec ate tgg att att ata ttt ggt gtg ata ttt 484 
Met Asp Pro Ser Val Pro He Trp He He He Phe Gly Val He Phe 

125 130 135 

tgc ate ate ata gtt gca att gca eta ctg att tta tea ggg ate tgg 532 
Cys He He He Val Ala He Ala Leu Leu He Leu Ser Gly He Trp 

140 145 150 



292 
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caa 


cat 


ada 


ara 


aag 


aac 


aaa 


gaa 


cca 


tct gaa 


gtg 


gat gac get 


gaa 


580 


Gin 


Arg 
155 


Xaa 


Xaa 


Lys 


Asn 


Lys 
160 


Glu 


Pro 


Ser Glu 


Val 
165 


Asp Asp Ala 


Glu 




rat 


aak 


tgt 


gaa 


aac 


atg 


ate 


aea 


att 


gaa aat 


gge 


ate ccc tct 


gat 


628 


Xaa 


Xaa 


Cys 


Glu 


Asn 


Met 


lie 


Thr 


He 


Glu Asn 


Gly 


He Pro Ser 


Asp 




170 










175 








180 






185 




ccc 


ctg 


gac 


atg 


aag 


gga 


ggg 


cat 


att 


aat gat 


gee 


ttc atg aca 


gag 


676 


Pro 


Leu 


Asp 


Met 


Lys 
190 


Gly 


Gly 


His 


He 


Asn Asp 
195 


Ala 


Phe Met Thr 
200 


Glu 




gat 


gag 


agg 


etc 


acc 


cct 


etc 


tgaagggetg ttgttctgct tcetcaaraa 


727 


Asp 


Glu 


Arg 


Leu 

205 


Thr 


Pro 


Leu 

















attaaacatt tgtttctgtg tgactgctga gcatcctgaa ataccaagag cagatcatat 787 
wttttgtttc aceattcttc ttttgtaata aattttgaat gtgcttgaaa aaaaaaaaaa 847 
c 848 



<210> 6 
<211> 14 
<212> PRT 

<213> Homo Sapiens 



<220> 

<221> SIGNAL 
<222> 1. .14 

<223> Von Heijne matrix 
score 10.7 

seq LWLLFFLVTAIHA/EL 



<400> 6 

Met Leu Trp Leu Leu Phe Phe Leu Val Thr Ala He His Ala 
15 10 



<210> 7 

<211> 826 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 15. .695 



<221> sig_peptide 
<222> 15. .80 
<223> score 8.5 

seq AALLLGLMMWTG/DE 

<400> 7 



aaccagaggt gccc 


atg 


ggt 


tgg 


aca 


atg agg ctg 


gtc 


aea 


gca 


gca 


ctg 


50 










Met 


Gly 


Trp Thr Met Arg Leu Val 


Thr 


Ala 


Ala 


Leu 
















-20 






-15 












tta 


ctg 


ggt 


etc 


atg 


atg 


gtg gtc 


act gga gac 


gag 


gat 


gag 


aac 


age 


98 


Leu 


Leu 


Gly 


Leu 


Met 


Met' 


Val 


Val 


Thr Gly Asp Glu Asp Glu Asn Ser 




-10 










-5 






1 








5 






ccg 


tgt 


gcc 


cat 


gag 


gcc 


etc 


ctg 


gac gag gac 


acc 


etc 


ttt 


tgc 


cag 


146 


Pro 


Cys 


Ala 


His 


Glu 


Ala 


Leu 


Leu 


Asp Glu Asp 


Thr 


Leu 


Phe 


Cys 


Gin 










10 










15 






20 








ggc 


ett 


gaa 


gtt 


ttc 


tac 


cca 


gag 


ttg ggg aac 


att 


ggc 


tgc 


aag 


gtt 


194 


Gly Leu 


Glu 


Val 


Phe 


Tyr 


Pro 


Glu 


Leu Gly Asn 


He 


Gly Cys 


Lys 


Val 








25 










30 






35 










gtt 


cct 


gat 


tgt 


aac 


aac 


tac 


aga 


cag aag ate 


ace 


tee 


tgg 


atg 


gag 


242 


Val 


Pro 


Asp 


Cys 


Asn 


Asn 


Tyr 


Arg 


Gin Lys He 


Thr 


Ser 


Trp 


Met 


Glu 






40 










45 






50 












ccg 


ata 


gtc 


aag 


ttc 


ccg 


ggg 


gee 


gtg gac ggc 


gca 


acc 


tat 


ate 


ctg 


290 
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Pro 


lie 


Val 


Lys 


Phe 


Pro 


Gly 


Ala 


Val 


Asp 


Gly 


Ala 


Thr 


Tyr 


He 


Leu 




55 










60 










65 










70 




gtg 


atg 


gtg 


gat 


cca 


gat 


gee 


cct 


age 


aga 


gea 


gaa 


ccc 


aga 


cag 


aga 


338 


Val 


Met 


Val 


Asp 


Pro 


Asp 


Ala 


Pro 


Ser 


Arg 


Ala 


Glu 


Pro 


Arg 


Gin 


Arg 












75 










80 










85 






ttc 


tgg 


aga 


cat 


tgg 


ctg 


gta 


aca 


gat 


ate 


aag 


ggc 


gee 


gae 


ctg 


aag 


386 


Phe 


Trp 


Arg 


His 


Trp 


Leu 


Val 


Thr 


Asp 


He 


Lys 


Gly 


Ala 


Asp 


Leu 


Lys 










90 










95 










100 








aaa 


ggg 


aag 


att 


cag 


ggc 


cag 


gag 


tta 


tea 


gee 


tac 


cag 


get 


ccc 


tec 


434 


Lys Gly 


Lys 


lie 


Gin 


Gly 


Gin 


Glu 


Leu 


Ser 


Ala 


Tyr 


Gin 


Ala 


Pro 


Ser 








105 










110 










115 










cca 


ccg 


gca 


cac 


agt 


ggc 


ttc 


cat 


cgc 


tac 


cag 


ttc 


ttt 


gtc 


tat 


ctt 


482 


Pro 


Pro 


Ala 


His 


Ser 


Gly 


Phe 


His 


Arg 


Tyr 


Gin 


Phe 


Phe 


Val 


Tyr 


Leu 






120 










125 










130 












cag 


gaa 


gga 


aag 


gtc 


ate 


tct 


etc 


ctt 


ccc 


aag 


gaa 


aac 


aaa 


act 


cga 


530 


Gin 


Glu 


Gly 


Lys 


Val 


He 


Ser 


Leu 


Leu 


Pro 


Lys 


Glu 


Asn 


Lys 


Thr 


Arg 




135 










140 










145 










150 




ggc 


tct 


tgg 


aaa 


atg 


gae 


aga 


ttt 


ctg 


aac 


cgt 


ttc 


cac 


ctg 


ggc 


gaa 


578 


Gly Ser 


Trp 


Lys 


Met 


Asp 


Arg 


Phe 


Leu 


Asn 


Arg 


Phe 


His 


Leu 


Gly 


Glu 












155 










160 










165 






cct 


gaa 


gca 


age 


acc 


cag 


ttc 


atg 


acc 


cag 


aac 


tac 


cag 


gae 


tea 


cca 


626 


Pro 


Glu 


Ala 


Ser 


Thr 


Gin 


Phe 


Met 


Thr 


Gin 


Asn 


Tyr 


Gin 


Asp 


Ser 


Pro 










170 










175 










180 








acc 


etc 


cag 


get 


ccc 


aga 


gaa 


agg 


gee 


age 


gag 


ccc 


aag 


cac 


aaa 


aac 


674 


Thr 


Leu 


Gin 


Ala 


Pro 


Arg 


Glu 


Arg 


Ala 


Ser 


Glu 


Pro 


Lys 


His 


Lys 


Asn 








185 










190 










195 










cag 


gcg 


gag 


ata 


get 


gee 


tgc 


tagatagceg getttgecat ccgggcatgt 


725 


Gin 


Ala 


Glu 


He 


Ala 


Ala 


Cys 























200 205 
ggccacactg cccaccaccg acgatgtggg tatggaaccc cctctggata cagaacccet 785 
tcttttccaa ataaaaaaaa aatcatccaa aaaaaaaaaa a 826 



<210> 8 
<211> 227 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -22. .-1 
<223> score 8.5 

seq AALLLGLMMWTG/DE 



<400> 8 



Met Gly 


Trp 


Thr 


Met 


Arg 


Leu 


Val 


Thr 


Ala 


Ala 


Leu 


Leu 


Leu Gly Leu 




-20 










-15 










-10 




Met Met 


Val 


Val 


Thr 


Gly 


Asp 


Glu 


Asp 


Glu 


Asn 


Ser 


Pro 


Cys Ala His 


-5 










1 








5 






10 


Glu Ala 


Leu 


Leu 


Asp 


Glu 


Asp 


Thr 


Leu 


Phe 


Cys 


Gin Gly 


Leu Glu Val 








15 










20 








25 


Phe Tyr 


Pro 


Glu 


Leu 


Gly 


Asn 


He 


Gly 


Cys 


Lys 


Val 


Val 


Pro Asp Cys 






30 










35 










40 


Asn Asn 


Tyr 


Arg 


Gin 


Lys 


He 


Thr 


Ser 


Trp 


Met 


Glu 


Pro 


He Val Lys 




45 










50 










55 




Phe Pro 


Gly 


Ala 


Val 


Asp 


Gly 


Ala 


Thr 


Tyr 


He 


Leu 


Val 


Met Val Asp 


60 










65 










70 






Pro Asp 


Ala 


Pro 


Ser 


Arg 


Ala 


Glu 


Pro 


Arg 


Gin 


Arg 


Phe 


Trp Arg His 


75 








80 










85 






90 


Trp Leu 


Val 


Thr 


Asp 


He 


Lys 


Gly 


Ala 


Asp 


Leu 


Lys 


Lys 


Gly Lys He 








95 










100 








105 


Gin Gly 


Gin 


Glu 


Leu 


Ser 


Ala 


Tyr 


Gin 


Ala 


Pro 


Ser 


Pro 


Pro Ala His 




110 










X15 










120 


Ser Gly 


Phe 


His 


Arg 


Tyr 


Gin 


Phe 


Phe 


Val 


Tyr 


Leu 


Gin Glu Gly Lys 
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130 




135 




Val 


lie 


Ser 


Leu 


Leu 


Pro 


Lys 


Glu 


Asn 


Lys Thr Arg Gly Ser Trp 


Lys 




140 










145 






150 




Met 


Asp 


Arg 


Phe 


Leu 


Asn 


Arg 


Phe 


His 


Leu Gly Glu Pro Glu Ala 


Ser 


155 










160 








165 


170 


Thr 


Gin 


Phe 


Met 


Thr 


Gin 


Asn 


Tyr 


Gin 


Asp Ser Pro Thr Leu Gin 


Ala 










175 










180 185 




Pro Arg 


Glu 


Arg 


Ala 


Ser 


Glu 


Pro 


Lys 


His Lys Asn Gin Ala Glu 


He 








190 










195 


200 





Ala Ala Cys 
205 



<210> 9 

<211> 852 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> 229. .735 



<221> sig_peptide 
<222> 229. .492 
<223> score 6.7 

seq VFALSSFLNKASA/VY 



<400> 9 

aatgactggc agtggcatca gcgatggcgg ctgcgtcggg gtcggttctg cagcgctgta 60 
tcgtgtcgcc ggcagggagg catagcgcct ctctgatctt cctgcatggc tcaggtgatt 120 
ctggacaagg attaagaatg tggatcaagc aggtttttaa atcaagattt aacattccaa 180 
cacataaaaa ttatttatcc aacagctcct cccagatcat atactcct atg aaa gga 237 

Met Lys Gly 

gga ate tec aat gta tgg ttt gac aga ttt aaa ata acc aat gac tgc 285 

Gly He Ser Asn Val Trp Phe Asp Arg Phe Lys He Thr Asn Asp Cys 

-85 -80 -75 -70 

cca gaa cac ctt gaa tea att gat gtc atg tgt caa gtg ctt act gat 333 

Pro Glu His Leu Glu Ser lie Asp Val Met Cys Gin Val Leu Thr Asp 

-65 -60 -55 

ttg att gat gaa gaa gta aaa agt ggc ate aag aag aac agg ata tta 381 
Leu He Asp Glu Glu Val Lys Ser Gly He Lys Lys Asn Arg He Leu 

-50 -45 -40 

ata gga gga tte tct atg gga gga tge atg gea atg eat tta gea tat 429 
He Gly Gly Phe Ser Met Gly Gly Cys Met Ala Met His Leu Ala Tyr 

-35 -30 -25 

aga aat cat caa gat gtg gea gga gta ttt get ctt tct agt ttt ctg 477 
Arg Asn His Gin Asp Val Ala Gly Val Phe Ala Leu Ser Ser Phe Leu 

-20 -15 -10 

aat aaa gea tct get gtt tac eag get ctt eag aag agt aat ggt gta 525 
Asn Lys Ala Ser Ala Val Tyr Gin Ala Leu Gin Lys Ser Asn Gly Val 
-5 15 10 

ctt cct gaa tta ttt cag tgt cat ggt act gea gat gag tta gtt ctt 573 
Leu Pro Glu Leu Phe Gin Cys His Gly Thr Ala Asp Glu Leu Val Leu 

15 20 25 

cat tct tgg gea gaa gag aca aac tea atg tta aaa tct eta gga gtg 621 
His Ser Trp Ala Glu Glu Thr Asn Ser Met Leu Lys Ser Leu Gly Val 

30 35 40 

acc acg aag ttt cat agt ttt cca aat gtt tac cat gag eta age aaa 
Thr Thr Lys Phe His Ser Phe Pro Asn Val Tyr His Glu Leu Ser Lys 

45 50 55 

act gag tta gac ata ttg aag tta tgg att ctt aca aag ctg cca gga 717 
Thr Glu Leu Asp He Leu Lys Leu Trp He Leu Thr Lys Leu Pro Gly 
60 65 70 75 

gaa atg gaa aaa caa aaa tgaatgaate aagagtgatt tgttaatgta 765 



669 
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Glu Met Glu Lys Gin Lys 
80 

agtgtaatgt ctttgtgaaa agtgattttt actgccaaat tataatgata attaaaatat 825 
taagaaatag caaaaaaaaa aaaaaaa 852 

<210> 10 
<211> 169 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -88. . -1 
<223> score 6.7 

seq VFALSSFLNKASA/VY 



<400> 10 






















Met 


Lys 


Gly Gly 


He 


Ser Asn 


Val 


Trp 


Phe 


Asp Arg 


Phe 


Lys 


He 


Thr 






-85 








-80 








-75 






Asn 


Asp 


Cys Pro 


Glu 


His Leu 


Glu 


Ser 


He 


Asp Val 


Met 


Cys 


Gin 


Val 






-70 






-65 








-60 








Leu 


Thr 


Asp Leu 


He Asp Glu 


Glu 


Val 


Lys 


Ser Gly 


He 


Lys 


Lys 


Asn 




-55 






-50 








-45 










Arg 


lie 


Leu He Gly Gly Phe 


Ser Met Gly Gly Cys Met Ala Met 


His 


"40 








-35 








-30 








-25 


Leu 


Ala 


Tyr Arg 


Asn 


His Gin 


Asp 


Val 


Ala Gly Val 


Phe 


Ala 


Leu 


Ser 








-20 








-15 








-10 




Ser 


Phe 


Leu Asn 


Lys 


Ala Ser 


Ala 


Val 


Tyr 


Gin Ala 


Leu 


Gin 


Lys 


Ser 


Asn 


Gly 


-5 

Val Leu 


Pro 


Glu Leu 


Phe 


Gin Cys 


His Gly Thr Ala 


Asp 


Glu 




10 






15 








20 










Leu 


Val 


Leu His 


Ser 


Trp Ala 


Glu 


Glu 


Thr 


Asn Ser 


Met 


Leu 


Lys 


Ser 


25 








30 








35 








40 


Leu 


Gly 


Val Thr 


Thr 


Lys Phe 


His 


Ser 


Phe 


Pro Asn 


Val 


Tyr 


His 


Glu 








45 








50 








55 




Leu 


Ser 


Lys Thr 


Glu 


Leu Asp 


He 


Leu 


Lys 


Leu Trp 


He 


Leu 


Thr 


Lys 






60 








65 








70 






Leu 


Pro 


Gly Glu Met 


Glu Lys 


Gin Lys 















75 80 



<210> 


11 


<211> 


1602 


<212> 


DNA 


<213> 


Homo sapiens 


<220> 




<221> 


CDS 


<222> 


24 . . 1004 


<221> 


sig_peptide 


<222> 


24. .170 


<223> 


score 5 . 6 




seq ACLSLGFFSLLWL/QL 


<400> 


11 



atgcgccgcc gcctctccgc acg atg ttc ccc teg egg agg aaa gcg gcg cag 53 

Met Phe Pro Ser Arg Arg Lys. Ala Ala Gin 
-45 -40 
ctg ccc tgg gag gac ggc agg tec ggg ttg etc tec gge ggc etc cct 101 
Leu Pro Trp Glu Asp Gly Arg Ser Gly Leu Leu Ser Gly Gly Leu Pro 

-35 -30 -25 

egg aag tgt tec gtc ttc cae ctg ttc gtg gee tgc etc teg ctg ggc 149 
Arg Lys Cys Ser Val Phe His Leu Phe Val Ala Cys Leu Ser Leu Gly 
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-20 -15 -10 

ttc ttc tec eta etc tgg ctg cag etc age tgc tct ggg gac gtg gee 197 
Phe Phe Ser Leu Leu Trp Leu Gin Leu Ser Cys Ser Gly Asp Val Ala 

-5 15 
egg gca gtc agg gga caa ggg cag gag acc teg ggc cct ecc cgt gee 245 
Arg Ala Val Arg Gly Gin Gly Gin Glu Thr Ser Gly Pro Pro Arg Ala 
10 15 20 25 

tgc cec eca gag ceg ecc cet gag cac tgg gaa gaa gac gca tec tgg 2 93 

Cys Pro Pro Glu Pro Pro Pro Glu His Trp Glu Glu Asp Ala Ser Trp 

30 35 40 

ggc cec cac egc ctg gca gtg ctg gtg ecc ttc cgc gaa egc ttc gag 341 
Gly Pro His Arg Leu Ala Val Leu Val Pro Phe Arg Glu Arg Phe Glu 

45 50 55 

gag etc ctg gtc ttc gtg cec cac atg egc cgc ttc ctg age agg aag 389 
Glu Leu Leu Val Phe Val Pro His Met Arg Arg Phe Leu Ser Arg Lys 

60 65 70 

aag ate egg cac eae ate tac gtg etc aac cag gtg gac cac ttc agg 437 
Lys lie Arg His His lie Tyr Val Leu Asn Gin Val Asp His Phe Arg 

75 80 85 

ttc aac egg gca gcg etc ate aac gtg ggc ttc ctg gag age age aac 485 
Phe Asn Arg Ala Ala Leu lie Asn Val Gly Phe Leu Glu Ser Ser Asn 
90 95 100 105 

age aeg gac tac att gee atg cac gac gtt gac ctg etc cct etc aac 533 
Ser 'Thr Asp Tyr lie Ala Met His Asp Val Asp Leu Leu Pro Leu Asn 

110 115 120 

gag gag ctg gac tat ggc ttt cct gag get ggg cec ttc cac gtg gee 581 
Glu Glu Leu Asp Tyr Gly Phe Pro Glu Ala Gly Pro Phe His Val Ala 

125 130 135 

tec ceg gag etc cac cct etc tac cac tac aag acc tat gtc ggc ggc 629 
Ser Pro Glu Leu His Pro Leu Tyr His Tyr Lys Thr Tyr Val Gly Gly 

140 145 150 

ate ctg ctg etc tec aag cag cac tac egg ctg tgc aat ggg atg tec 677 
lie Leu Leu Leu Ser Lys Gin His Tyr Arg Leu Cys Asn Gly Met Ser 

155 160 165 

aac cgc ttc tgg ggc tgg ggc cgc gag gac gac gag ttc tac egg egc 725 
Asn Arg Phe Trp Gly Trp Gly Arg Glu Asp Asp Glu Phe Tyr Arg Ajrg 
170 175 180 185 

att aag gga get ggg etc cag ctt ttc cgc cec teg gga ate aea act 773 
He Lys Gly Ala Gly Leu Gin Leu Phe Arg Pro Ser Gly He Thr Thr 

190 195 200 

ggg tac aag aea ttt egc cac ctg cat gac eca gee tgg egg aag agg 821 
Gly Tyr Lys Thr Phe Arg His Leu His Asp Pro Ala Trp Arg Lys Arg 

205 210 215 

gac cag aag cgc ate gca get caa aaa cag gag cag ttc aag gtg gac 869 
Asp Gin Lys Arg He Ala Ala Gin Lys Gin Glu Gin Phe Lys Val Asp 

220 225 230 

agg gag gga ggc ctg aac act gtg aag tac cat gtg get tee cgc act 917 
Arg Glu Gly Gly Leu Asn Thr Val Lys Tyr His Val Ala Ser Arg Thr 

235 240 245 

gee ctg tct gtg ggc ggg gee cec tgc act gtc etc aac ate atg ttg 965 
Ala Leu Ser Val Gly Gly Ala Pro Cys Thr Val Leu Asn He Met Leu 
250 255 260 265 

gac tgt gac aag acc gcc aea cec tgg tgc aea ttc age tgagetggat 1014 
Asp Cys Asp Lys Thr Ala Thr Pro Trp Cys Thr Phe Ser 

270 275 
ggacagtgag gaagectgta ectaeaggee atattgctea ggctcaggae aaggcctcag 1074 
gtegtgggcc cagctctgae aggatgtgga gtggecagga ceaagacage aagctaegca 1134 
attgeageca ceeggccgcc aaggcagget tgggctgggc caggaeacgt ggggtgeetg 1194 
ggacgetgct tgceatgeae agtgatcaga gagaggctgg ggtgtgtcct gtccgggacc 1254 
eeccctgect tectgcteae cetactctga cctccttcac gtgcccaggc ctgtgggtag 1314 
tggggaggge tgaacaggae aacctctcat cacccccact tttgttectt cetgctgggc 1374 
tgcctegtge agagacacag tgtaggggee atgcagetgg cgtaggtgge agttgggect 1434 
ggtgagggtt aggacttcag aaaceagagc acaagcecea cagaggggga acagccagca 14 94 
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ccgctctagc tggttgttgc catgccggaa tgtgggccta gtgttgccag atcttctgat 1554 
ttttcgaaag aaactagaat gctggattct caaaaaaaaa aaaaaaaa 1602 

<210> 12 
<211> 327 
<212> PRT 

<213> Hotno sapiens 
<220> 

<221> SIGNAL 
<222> -49. . -1 
<223> score 5.6 

seq ACLSLGFFSLLWL/QL 

<400> 12 



Met 


Phe 


Pro 


Ser 


Arg 


Arg 


Lys 


Ala 


Ala 


Gin 


Leu 


Pro 


Trp 


Glu Asp Gly 










-45 










-4 0 








-35 


Arg 


Ser 


Gly 


Leu 


Leu 


Ser 


Gly 


Gly 


Leu 


Pro 


Arg 


Lys 


Cys 


Ser Val Phe 








-30 










-25 










-20 


His 


Leu 


Phe 


Val 


Ala 


Cys 


Leu 


Ser 


Leu 


Gly 


Phe 


Phe 


Ser 


Leu Leu Trp 






-15 










-10 










-5 




Leu 


Gin 


Leu 


Ser 


Cys 


Ser 


Gly 


Asp 


Val 


Ala 


Arg 


Ala 


Val 


Arg Gly Gin 




1 








5 










10 






15 


Gly Gin 


Glu 


Thr 


Ser 


Gly 


Pro 


Pro 


Arg 


Ala 


Cys 


Pro 


Pro 


Glu Pro Pro 










20 










25 








30 


Pro 


Glu 


His 


Trp 


Glu 


Glu 


Asp 


Ala 


Ser 


Trp 


Gly 


Pro 


His 


Arg Leu Ala 








35 










40 










45 


Val 


Leu 


Val 


Pro 


Phe 


Arg 


Glu 


Arg 


Phe 


Glu 


Glu 


Leu 


Leu 


Val Phe Val 






50 










55 










60 




Pro 


His 


Met 


Arg 


Arg 


Phe 


Leu 


Ser 


Arg 


Lys 


Lys 


He 


Arg 


His Has He 




65 










70 










75 






Tyr 


Val 


Leu 


Asn 


Gin 


Val 


Asp 


His 


Phe 


Arg 


Phe 


Asn 


Arg 


Ala Ala Leu 


80 




























He 


Asn 


Val 


Gly 


Phe 


Leu 


Glu 


Ser 


Ser 


Asn 


Ser 


Thr 


Asp 


Tyr He Ala 










100 










105 








110 


Met 


His 


Asp 


Val 


Asp 


Leu 


Leu 


Pro 


Leu 


Asn 


Glu 


Glu 


Leu 


Asp Tyr Gly 








115 










120 










125 


Phe 


Pro 


Glu 


Ala 


Gly 


Pro 


Phe 


His 


Val 


Ala 


Ser 


Pro 


Glu 


Leu His Pro 






130 










135 










14 0 




Leu Tyr 


His 


Tyr 


Lys 


Thr 


Tyr 


Val 


Gly 


Gly 


He 


Leu 


Leu 


Leu Ser Lys 




145 










150 










155 






Gin 


His 


Tyr 


Arg 


Leu 


Cys 


Asn 


Gly 


Met 


Ser 


Asn 


Arg 


Phe 


Trp Gly Trp 


160 








165 










170 






175 


Gly Arg 


Glu 


Asp 


Asp 


Glu 


Phe 


Xyr 


Arg 


Arg 


He 


Lys 


Gly 


Ala Gly Leu 










180 










185 








190 


Gin 


Leu 


Phe 


Arg 


Pro 


Ser 


Gly 


He 


Thr 


Thr 


Gly 


Tyr 


Lys 


Thr Phe Arg 








195 










200 










205 


His 


Leu 


His 


Asp 


Pro 


Ala 


Trp 


Arg 


Lys 


Arg 


Asp 


Gin 


Lys 


Arg He Ala 






210 










215 










220 




Ala 


Gin 


Lys 


Gin 


Glu 


Gin 


Phe 


Lys 


Val 


Asp 


Arg 


Glu 


Gly 


Gly Leu Asn 




225 










230 










235 






Thr 


Val 


Lys 


Tyr 


His 


Val 


Ala 


Ser 


Arg 


Thr 


Ala 


Leu 


Ser 


Val Gly Gly 


240 










245 










250 






255 


Ala 


Pro 


Cys 


Thr 


Val 


Leu 


Asn 


He 


Met 


Leu 


Asp 


Cys 


Asp 


Lys Thr Ala 










260 










265 








270 


Thr 


Pro 


Trp 


Cys 


Thr 


Phe 


Ser 

















275 

<210> 13 
<211> 1568 
<212> DNA 

<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 75. .1259 

<221> sig_peptide 
<222> 75, .1004 
<223> score 4.4 

seq VLILLFSLALIIL/PS 

<400> 13 

agaaaaggtg tagtgtttgg ggcggtcaac gggctatgct ggcttgacag ggctgggctc 60 



ttcagaacag aagc 


atg 


gat 


etc 


gga 


ate 


cct 


gac 


ctg 


ctg 


gac 


gcg 


tgg 


110 










Met 


Asp 


Leu Gly He 


Pro Asp 


Leu 


Leu 


Asp 


Ala 


Trp 












-310 








-305 








-300 




ctg 


gag 


ccc 


cca 


gag 


gat 


ate 


ttc 


teg 


aca 


gga 


tec 


gtc 


ctg 


gag 


ctg 


158 


Leu 


Glu 


Pro 


Pro 


Glu 


Asp 


He 


Phe 


Ser 


Thr 


Gly 


Ser 


Val 


Leu 


Glu 


Leu 










-295 








-290 








-285 






gga 


etc 


cac 


tgc 


ccc 


cct 


cca 


gag gtt 


ccg 


gta 


act 


agg 


eta 


cag 


gaa 


z uo 


Gly Leu 


His 


Cys 


Pro 


Pro 


Pro 


Glu 


Val 


Pro 


Val 


Thr 


Arg 


Leu 


Gin 


Glu 








-280 








-275 








-270 








cag 


gga 


ctg 


caa 


ggc 


tgg 


aag 


tec 


ggt 


ggg 


gac 


cgt 


ggc 


tgt 


ggc 


ett 




Gin Gly Leu 


Gin 


Gly 


Trp 


Lys 


Ser 


Gly 


Gly 


Asp 


Arg 


Gly 


Cys 


Gly 


Leu 






-265 








-260 








-255 










caa 


gag 


agt 


gag 


cct 


gaa 


gat 


ttc 


ttg 


aag 


ett 


ttc 


att 


gat 


ccc 


aat 




Gin 


Glu 


Ser 


Glu 


Pro 


Glu 


Asp 


Phe 


Leu 


Lys 


Leu 


Phe 


He 


Asp 


Pro 


Asn 




-250 








-245 








-240 








-235 




gag 


gtg 


tac 


tgc 


tea 


gaa 


gca 


tct 


cct 


ggc 


agt 


gac 


agt 


ggc 


ate 


tct 


O 3 u 


Glu 


Val 


Tyr 


Cys 


Ser 


Glu 


Ala 


Ser 


Pro 


Gly 


Ser 


Asp 


Ser 


Gly 


He 


Ser 












-230 








-225 








-220 




gag 


gac 


tec 


tgc 


cat 


cca 


gac 


agt 


ccc 


cct 


gcc 


ccc 


agg 


gca 


acc 


agt 


398 


Glu Asp 


Ser 


Cys 


His 


Pro 


Asp 


Ser 


Pro 


Pro 


Ala 


Pro 


Arg 


Ala 


Thr 


Ser 










-215 








-210 








-205 






tct 


cct 


atg 


etc 


tat 


gag 


gtt 


gtc 


tat 


gag 


gca 


ggg 


gcc 


ctg 


gag 


agg 


446 


Ser 


Pro 


Met 


Leu 


Tyr 


Glu 


Val 


val 


Tyr 


Glu 


Ala 


Gly 


Ala 


Leu 


Glu 


Arg 








-200 








-195 








-190 








atg cag 


ggg 


gaa 


act 


ggg 


cca 


aat 


gta ggc 


ett 


ate 


tee 


ate 


cag 


eta 


4 94 


Met 


Gin 


Gly 


Glu 


Thr 


Gly 


Pro 


Asn 


Val 


Gly Leu 


He 


Ser 


He 


Gin 


Leu 






-185 








-180 








-175 










gat 


cag 


tgg 


age 


cca 


gca 


ttt 


atg 


gtg 


cct 


gat 


tec 


tgc 


atg 


gtc 


agt 


542 


Asp Gin 


Trp 


Ser 


Pro 


Ala 


Phe 


Met 


Val 


Pro 


Asp 


Ser 


Cys 


Met 


val 


Ser 




-170 








-165 








-160 








-155 




gag 


ctg 


ccc 


ttt 


gat 


get 


eat 


gcc 


cac 


ate 


ctg 


ccc 


aga 


gca 


ggc 


ace 


590 


Glu 


Leu 


Pro 


Phe 


Asp 


Ala 


His 


Ala 


His 


He 


Leu 


Pro 


Arg 


Ala 


Gly Thr 












-150 








-145 








-140 




gta 


gcc 


cca 


gtg 


ccc 


tgt 


aca 


acc 


ctg 


ctg 


ccc 


tgt 


caa 


acc 


ctg 


ttc 


638 


Val 


Ala 


Pro 


Val 


Pro 


Cys 


Thr 


Thr 


Leu 


Leu 


Pro 


Cys 


Gin 


Thr 


Leu 


Phe 










-135 








-130 








-125 






ctg 


acc 


gat 


gag 


gag 


aag 


cgt 


ctg 


ctg 


ggg 


cag 


gaa 


ggg 


gtt 


tee 


ctg 


686 


Leu 


Thr 


Asp 


Glu 


Glu 


Lys 


Arg 


Leu 


Leu 


Gly 


Gin 


Glu 


Gly 


Val 


Ser 


Leu 








-120 








-115 








-110 








ccc 


tct 


cac 


ctg 


ccc 


etc 


acc 


aag 


gca 


gag 


gag 


agg 


gtc 


etc 


aag 


aag 


734 


Pro 


Ser 


His 


Leu 


Pro 


Leu 


Thr 


Lys 


Ala 


Glu 


Glu 


Arg 


Val 


Leu 


Lys 


Lys 






-105 








-100 








-95 












gtc 


agg 


agg 


aaa 


ate 


cgt 


aac 


aag 


cag 


tea 


get 


cag 


gac 


agt 


egg 


egg 


782 


Val 


Arg 


Arg 


Lys 


lie 


Arg 


Asn 


Lys 


Gin 


Ser 


Ala 


Gin 


Asp 


Ser 


Arg 


Arg 




-90 










-85 










-80 










-75 




egg 


aag 


aag 


gag 


tac 


att 


gat 


ggg 


ctg 


gag 


age 


agg 


gtg 


gca 


gcc 


tgt 


830 


Arg 


Lys 


Lys 


Glu 


Tyr 


He 


Asp Gly 


Leu 


Glu 


Ser 


Arg 


Val 


Ala 


Ala 


Cys 












-70 










-65 










-60 






tct 


gca 


cag 


aac 


caa 


gaa 


tta 


cag 


aaa 


aaa 


gtc 


cag 


gag 


ctg 


gag 


agg 


878 


Ser 


Ala 


Gin 


Asn 
-55 


Glh 


Glu 


Leu 


Gin 


Lys 
-50 


Lys 


Val 


Gin 


Glu 


Leu 
-45 


Glu 


Arg 




cac 


aac 


ate 


tec 


ttg 


gta 


get 


cag 


etc 


egc 


cag 


ctg 


cag 


a eg 


eta 


att 


926 
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His Asn lie Ser Leu Val Ala Gin Leu Arg Gin Leu Gin Thr Leu lie 

-40 -35 -30 

get caa act tec aac aaa get gcc cag acc age act tgt gtt ttg att 
Ala Gin Thr Ser Asn Lys Ala Ala Gin Thr Ser Thr Cys Val Leu lie 

-25 -20 -15 

ett ctt ttt tec ctg get etc ate ate ctg ecc age ttc agt cca tte 1022 
Leu Leu Phe Ser Leu Ala Leu lie He Leu Pro Ser Phe Ser Pro Phe 
-10 -5 15 

cag agt cga cca gaa get ggg tct gag gat tac cag cct cae gga gtg 1070 
Gin Ser Arg Pro Glu Ala Gly Ser Glu Asp Tyr Gin Pro His Gly Val 

10 15 20 

act tec aga aat ate ctg ace cae aag gac gta aca gaa aat ctg gag 1118 
Thr Ser Arg Asn He Leu Thr His Lys Asp Val Thr Glu Asn Leu Glu 

25 30 35 

acc caa gtg gta gag tec aga ctg agg gag cca cct gga gcc aag gat 1166 
Thr Gin Val Val Glu Ser Arg Leu Arg Glu Pro Pro Gly Ala Lys Asp 

40 45 50 

gea aat ggc tea aca agg aca ctg ctt gag aag atg gga ggg aag cca 1214 
Ala Asn Gly Ser Thr Arg Thr Leu Leu Glu Lys Met Gly Gly Lys Pro 
55 60 65 70 

aga ecc agt ggg cgc ate egg tee gtg ctg cat gea gat gag atg 1259 
Arg Pro Ser Gly Arg He Arg Ser Val Leu His Ala Asp Glu Met 

75 80 85 

tgagctggaa cagaccttcc tggcccactt cctgatcaca aggaatcctg ggcttcctta 1319 
tggctttctt cccactggga ttcctactta ggtgtctgcc ctcaggggte caaatcaett 1379 
caggaeacce caagagatgt cctttagtct ctgcctgagg cctagtetgc atttgtttgc 1439 
atatatgaga gggtacctea aatacttctg ttatgtatct gtgattttat ttcttctttg 1499 
ggtatagggt tgaggggaaa taagttttga gtgagaaata aacgttttag ctgaaaaaaa 1559 
aaaaaaaaa 1568 



<210> 14 

<211> 395 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -310 . . -1 
<223> score 4.4 

seq VLILLFSLALIIL/PS 



<400> 14 

Met Asp Leu Gly He Pro Asp Leu Leu Asp Ala Trp Leu Glu Pro Pro 
-310 -305 -300 -295 

Glu Asp He Phe Ser Thr Gly Ser Val Leu Glu Leu Gly Leu His Cys 

-290 -285 -280 

Pro Pro Pro Glu Val Pro Val Thr Arg Leu Gin Glu Gin Gly Leu Gin 

-275 -270 -265 

Gly Trp Lys Ser Gly Gly Asp Arg Gly Cys Gly Leu Gin Glu Ser Glu 

-260 -255 -250 

Pro Glu Asp Phe Leu Lys Leu Phe He Asp Pro Asn Glu Val Tyr Cys 

-245 -240 -235 

Ser Glu Ala Ser Pro Gly Ser Asp Ser Gly He Ser Glu Asp Ser Cys 
-230 -225 -220 -215 

His Pro Asp Ser Pro Pro Ala Pro Arg Ala Thr Ser Ser Pro Met Leu 

-210 -205 -200 

Tyr Glu Val Val Tyr Glu Ala Gly Ala Leu Glu Arg Met Gin Gly Glu 

-195 -190 -185 

Thr Gly Pro Asn Val Gly Leu He Ser He Gin Leu Asp Gin Trp Ser 

-180 -175 -170 

Pro Ala Phe Met Val Pro Asp Ser Cys Met Val Ser Glu Leu Pro Phe 

-165 -160 -155 

Asp Ala His Ala His He Leu Pro Arg Ala Gly Thr Val Ala Pro Val 
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-150 -145 -140 -135 

Pro Cys Thr Thr Leu Leu Pro Cys Gin Thr Leu Phe Leu Thr Asp Glu 

-130 -125 -120 

Glu Lys Arg Leu Leu Gly Gin Glu Gly Val Ser Leu Pro Ser His Leu 

-115 -110 -105 

Pro Leu Thr Lys Ala Glu Glu Arg Val Leu Lys Lys Val Arg Arg Lys 

-100 -95 -90 

lie Arg Asn Lys Gin Ser Ala Gin Asp Ser Arg Arg Arg Lys Lys Glu 

-85 -80 -75 

Tyr lie Asp Gly Leu Glu Ser Arg Val Ala Ala Cys Ser Ala Gin Asn 
-70 -65 -60 -55 

Gin Glu Leu Gin Lys Lys Val Gin Glu Leu Glu Arg His Asn lie Ser 

-50 -45 -40 

Leu Val Ala Gin Leu Arg Gin Leu Gin Thr Leu lie Ala Gin Thr Ser 

-35 -30 -25 

Asn Lys Ala Ala Gin Thr Ser Thr Cys Val Leu lie Leu Leu Phe Ser 

-20 -15 -10 

Leu Ala Leu lie lie Leu Pro Ser Phe Ser Pro Phe Gin Ser Arg Pro 

-5 15 10 

Glu Ala Gly Ser Glu Asp Tyr Gin Pro His Gly Val Thr Ser Arg Asn 

15 20 25 

lie Leu Thr His Lys Asp Val Thr Glu Asn Leu Glu Thr Gin Val Val 

30 35 40 

Glu Ser Arg Leu Arg Glu Pro Pro Gly Ala Lys Asp Ala Asn Gly Ser 

45 50 55 

Thr Arg Thr Leu Leu Glu Lys Met Gly Gly Lys Pro Arg Pro Ser Gly 

60 65 70 

Arg lie Arg Ser Val Leu His Ala Asp Glu Met 
75 80 85 

<210> 15 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<400> 15 

gggaagatgg agatagtatt gcctg 25 



<210> 16 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<400> 16 

ctgccatgta catgatagag agattc 26 

<210> 17 
<211> 546 
<212> DNA 

<213> Homo Sapiens 



<220> 

<221> promoter 
<222> 1. .517 



<221> transcription start site 
<222> 518 

<221> protein_bind 
<222> 17. .25 

<223> matinspector prediction 
name CMYB_01 
score 0.983 
sequence tgtcagttg 
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<221> protein_bind 
<222> complement (18 . .27) 
<223> matinspector prediction 
name MY0D_Q6 

score 0.961 
sequence cccaactgac 

<221> protein_bind 
<222> complement (75 85) 
<223> matinspector prediction 

name S8_01 

score 0.960 

sequence aatagaattag 

<221> protein_bind 
<222> 94 . . 104 

<223> matinspector prediction 
name S8_01 
score 0.966 
sequence aactaaattag 

<221> protein_bind 

<222> complement (129.. . 139) 

<223> matinspector prediction 

name DELTAEFl^Ol 

score 0.960 

sequence gcacacctcag 

<221> protein_bind 

<222> complement (155. .165) 

<223> matinspector prediction 

name GATA_C 

score 0 . 964 

sequence agataaatcca 

<221> protein_bind 
<222> 170. .178 

<223> matinspector prediction 
name CMYB_01 
score 0.9S8 
sequence cttcagttg 

<221> protein_bind 
<222> 176. .189 

<223> matinspector prediction 
name GATA1_02 
score 0.959 

sequence ttgtagataggaca 

<221> protein_bind 
<222> 180. .190 

<223> matinspector prediction 
name GATA_C 
score 0.953 
sequence agataggacat 

<221> protein_bind 
<222> 284. .299 

<223> matinspector prediction 
name TAL1ALPHAE4 7_01 
score 0.973 

sequence cataacagatggtaag 
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<221> protein_bind 
<222> 284. .299 

<223> matinspector prediction 
name TAL1BETAE47_01 
score 0.983 

sequence cataacagatggtaag 

<221> protein_bind 
<222> 284 . .299 

<223> matinspector prediction 
name TAL1BETAITF2_01 
score 0.978 

sequence cataacagatggtaag 

<221> protein_bind 

<222> complement {287. .296) 

<223> matinspector prediction 

name MYOD^QS 

score 0.954 

sequence accatctgtt 

<221> protein_bind 

<222> complement (302 . .314) 

<223> matinspector prediction 

name GATA1_04 

score 0.953 

sequence tcaagataaagta 

<221> protein_bind 
<222> 393. .405 

<223> matinspector prediction 
name IK1_01 
score 0.963 

sequence agttgggaattcc 

<221> protein_bind 
<222> 393 . .404 

<223> matinspector prediction 
name IK2_01 
score 0.985 
sequence agttgggaattc 

<221> protein_bind 
<222> 396. .405 

<223> matinspector prediction 
name CREL_01 
score 0.962 
sequence tgggaattcc 

<221> protein_bind 
<222> 423. .436 

<223> matinspector prediction 
name GATA1_02 
score 0.950 

sequence tcagtgatatggca 

<221> protein_bind 

<222> complement <478. .489) 

<223> matinspector prediction 

name SRY_02 

score 0.9S1 

sequence taaaacaaaaca 
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<221> protein_bind 
<222> 486. .493 

<223> matinspector prediction 
name E2F_02 
score 0.957 
sequence tttagcgc 

<221> protein_bind 

<222> complement (5 14. .521) 

<223> matinspector prediction 

name MZF1_01 

score 0.975 

sequence tgagggga 



<400> 17 

tgagtgcagt gttacatgtc agttgggtta agtttgttaa tgtcattcaa atcttctatg 60 

tcttgatttg cctgctaatt ctattatttc tggaactaaa ttagtttgat ggttctatta 120 

gttattgact gaggtgtgct aatctcccat tatgtggatt tatctatttc ttcagttgta 180 

gataggacat tgatagatac ataagtacca ggacaaaagc agggagatct tttttccaaa 240 

atcaggagaa aaaaatgaca tctggaaaac ctatagggaa aggcataaca gatggtaagg 300 

atactttatc ttgagtagga gagccttcct gtggcaacgt ggagaaggga agaggtcgta 360 

gaattgagga gtcagctcag ttagaagcag ggagttggga attccgttca tgtgatttag 420 

catcagtgat atggcaaatg tgggactaag ggtagtgatc agagggttaa aattgtgtgt 480 

tttgttttag cgctgctggg gcatcgcctt gggtcccctc aaacagattc ccatgaatct 540 

cttcat 546 



<210> 18 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<400> 18 

gtaccaggga ctgtgaccat tgc 23 

<210> 19 
<211> 24 
<212> DNA 

<213> Artificial Sequence 

<400> 19 

ctgtgaccat tgctcccaag agag 24 

<210> 20 
<211> 861 
<212> DNA 

<213> Homo Sapiens 



<220> 

<221> promoter 
<222> 1. .806 



<221> transcription start site 
<222> 807 



<221> protein_bind 
<222> complement (60 70) 
<223> matinspector prediction 

name NFY_Q6 

score 0.956 

sequence ggaccaatcat 



<221> protein_bind 
<222> 10. .11 

<223> matinspector prediction 
name MZFl 01 
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score 0.962 
sequence cctgggga 

<221> protein_bind 
<222> 124 . .132 

<223> matinspector prediction 
name CMYB_Ol 
score 0.994 
sequence tgaccgttg 

<221> protein_bind 

<222> complement (126 134) 

<223> matinspector prediction 

name VMYB_02 

score 0.985 

sequence tccaacggt 

<221> protein_bind 
<222> 135 . . 143 

<223> matinspector prediction 
name STAT_01 
score 0.968 
sequence ttcctggaa 

<221> protein_bind 

<222> complement (135 143) 

<223> matinspector prediction 

name STAT_01 

score 0.951 

sequence ttccaggaa 

<221> protein_bind 

<222> complement (252 259) 

<223> matinspector prediction 

name MZF1_01 

score 0.956 

sequence ttggggga 

<221> protein_bind 
<222> 357 . .368 

<223> matinspector prediction 
name IK2_01 
score 0.965 
sequence gaatgggatttc 

<221> protein_bind 
<222> 384 . .391 

<223> matinspector prediction 
name MZF1_01 
score 0.986 
sequence agagggga 

<221> protein_bind 

<222> complement (410 . .421) 

<223> matinspector prediction 

name SRY__02 

score 0.955 

sequence gaaaacaaaaca 

<221> protein_bind 
<222> 592. .599 

<223> matinspector prediction 
name MZFl 01 



wo 99/53051 



PCT/IB99/00712 



17 

score 0.960 
sequence gaagggga 

<221> protein_bind 
<222> 618. .627 

<223> matinspector prediction 
name MyOD_Q6 
score 0.981 
sequence agcatctgcc 

<221> protein_bind 
<222> 632. .642 

<223> matinspector prediction 
name DELTAEF1_01 
score 0.958 
sequence tcccaccttcc 

<221> protein_bind 

<222> complement (813 823) 

<223> matinspector prediction 

name S8_01 

score 0.992 

sequence gaggcaattat 

<221> protein_bind 

<222> complement (824 831) 

<223> matinspector prediction 

name MZFi_oi 

score 0.986 

sequence agagggga 

<221> misc_feature 

<222> 335,376 

<223> n=a, g, c or t 



<400> 20 

tactataggg cacgcgtggt cgacggccgg gctgttctgg agcagagggc atgtcagtaa 60 

tgattggtcc ctggggaagg tctggctggc tccagcacag tgaggcattt aggtatctct 120 

cggtgaccgt tggattcctg gaagcagtag ctgttctgtt tggatctggt agggacaggg 180 

ctcagagggc taggcacgag ggaaggtcag aggagaaggs aggsarggcc cagtgagarg 240 

ggagcatgcc ttcccccaac cctggcttsc ycttggymam agggcgktty tgggmacttr 300 

aaytcagggc ccaascagaa scacaggccc aktcntggct sraaagcacaa tagcctgaat 360 

gggatttcag gttagncagg gtgagagggg aggctctctg gcttagtttt gttttgtttt 420 

ccaaatcaag gtaacttgct cccttctgct acgggccttg gtcttggctt gtcctcaccc 480 

agtcggaact ccctaccact ttcaggagag tggttttagg cccgtggggc tgttctgttc 540 

caagcagtgt gagaacatgg ctggtagagg ctctagctgt gtgcggggcc tgaaggggag 600 

tgggttctcg cccaaagagc atctgcccat ttcccacctt cccttctccc accagaagct 660 

tgcctgagct gtttggacaa aaatccaaac cccacttggc tactctggcc tggcttcagc 720 

ttggaaccca atacctaggc ttacaggcca tcctgagcca ggggcctctg gaaattctct 780 

tcctgatggt cctttaggtt tgggcacaaa atataattgc ctctcccctc tcccattttc 840 

tctcttggga gcaatggtca c 861 

<210> 21 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<400> 21 

ctgggatgga aggcacggta 20 

<210> 22 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
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<400> 22 

gagaccacac agctagacaa 

<210> 23 
<211> 555 
<212> DNA 

<213> Homo Sapiens 
<220> 

<221> promoter 
<222> 1. .500 

<221> transcription start site 
<222> 501 

<221> proteinbind 
<222> 191. -206 

<223> matinspector prediction 
name ARNT_01 
score 0.964 

sequence ggactcacgtgctgct 

<221> protein_bind 
<222> 193 . .204 

<223> matinspector prediction 
name NMYC_01 
score 0.965 
sequence actcacgtgctg 

<221> protein_bind 
<222> 193. .204 

<223> matinspector prediction 
name USF_01 
score 0.9 85 
sequence actcacgtgctg 

<221> protein_bind 

<222> complement (193 .. 204) 
<223> matinspector prediction 

name USF_01 

score 0.985 

sequence cagcacgtgagt 

<221> protein_bind 

<222> complement (193 . .204) 

<223> matinspector prediction 

name NMYC_01 

score 0.956 

sequence cagcacgtgagt 

<221> protein_bind 

<222> complement (193 .. 204) 
<223> matinspector prediction 

name MYCMAX_02 

score 0.972 

sequence cagcacgtgagt 

<221> protein_bind 
<222> 195. .202 

<223> matinspector prediction 
name USF_C 
score 0.997 
sequence tcacgtgc 
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<221> protein_bind 

<222> complement (195 . .202) 

<223> matinspector prediction 

name USF_C 

score 0.991 

sequence gcacgtga 

<221> protein_bind 

<222> complement (210. .217) 

<223> matinspector prediction 

name MZF1_01 

score 0.968 

sequence catgggga 

<221> protein_bind 
<222> 397. .410 

<223> matinspector prediction 
name ELK1_02 
score 0.963 

sequence ctctccggaagcct 

<221> protein_bind 
<222> 400. ,409 

<223> matinspector prediction 
name CETS1P54_01 
score 0.974 
sequence tccggaagcc 

<221> protein_bind 

<222> complement (460 . .470) 

<223> matinspector prediction 

name AP1_Q4 

score 0.963 

sequence agtgactgaac 

<221> protein_bind 

<222> complement (460. .470) 

<223> matinspector prediction 

name AP1FJ_Q2 

score 0.961 

sequence agtgactgaac 

<221> protein_bind 
<222> 547. .555 

<223> matinspector prediction 
name PADS_C 
score 1.000 
sequence tgtggtctc 

<400> 23 

ctatagggca cgcktggtcg acggcccggg ctggtctggt ctgtkgtgga gtcgggttga 60 

aggacagcat ttgtkacatc tggtctactg caccttccct ctgccgtgca cttggccttt 120 

kawaagctca gcaccggtgc ccatcacagg gccggcagca cacacatccc attactcaga 180 

aggaactgac ggactcacgt gctgctccgt ccccatgagc tcagtggacc tgtctatgta 24 0 

gagcagtcag acagtgcctg ggatagagtg agagttcagc cagtaaatcc aagtgattgt 300 

cattcctgtc tgcattagta actcccaacc tagatgtgaa aacttagttc tttctcatag 360 

gttgctctgc ccatggtccc actgcagacc caggcactct ccggaagcct ggaaatcacc 420 

cgtgtcttct gcctgctccc gctcacatcc cacacttgtg ttcagtcact gagttacaga 480 

ttttgcctcc tcaatttctc ttgtcttagt cccatcctct gttcccctgg ccagtttgtc 540 

tagctgtgtg gtctc 555 



<210> 24 



wo 99/53051 



PCT/1B99/007I2 



20 

<211> 251 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 13. .249 



<221> sigjpeptide 

<222> 13 . .81 

<223> Von Heijne matrix 

score 11.8000001907349 

seq CLFVCLFLSQSFA/FV 

<400> 24 

aaaagtattg gg atg cct agt tac aar gtg tgt ggg gtt ttt tgt ttg ttt 51 
Met Pro Ser Tyr Lys Val Cys Gly Val Phe Cys Leu Phe 
-20 -15 
gtt tgt ttg ttt ttg age cag agt ttt get ttt gtc cte cag get gga 99 
Val Cys Leu Phe Leu Ser Gin Ser Phe Ala Phe Val Leu Gin Ala Gly 
-10 -5 15 

gtg cag tgg cgc gat etc tge tea ctg caa cct cag ctt ccc agg ttc 147 
Val Gin Trp Arg Asp Leu Cys Ser Leu Gin Pro Gin Leu Pro Arg Phe 

10 15 20 

ggg cca tec tee tge etc age cte cca agt gge tgg gae tge agg cgc 195 
Gly Pro Ser Ser Cys Leu Ser Leu Pro Ser Gly Trp Asp Cys Arg Arg 

25 30 35 

cca cca cca cgc ctg get aat tct tgt gtt ttc ggt gga gae ggg gtt 243 
Pro Pro Pro Arg Leu Ala Asn Ser Cys Val Phe Gly Gly Asp Gly Val 

40 45 50 

tea ceg gg 251 
Ser Pro 
55 



<210> 25 

<211> 274 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 35. .274 



<221> sig_peptide 

<222> 35. .82 

<223> Von Heijne matrix 

score 14.8000001907349 
seq SLPLLLLLLGAWA/IP 



<400> 25 

acagactaca cttgctgaac tggctcctgg ggcc atg agg ctg tea ctg cca ctg 55 

Met Arg Leu Ser Leu Pro Leu 





















-15 




-10 




ctg ctg 


ctg 


ctg 


ctg 


gga 


gee 


tgg 


gcc 


ate 


cca ggg 


gge etc ggg 


gae 


103 


Leu Leu Leu Leu Leu Gly Ala Trp 

_ c 


Ala 


He 
1 


Pro Gly 


Gly Leu Gly 
5 


Asp 




agg gcg 


cca 


etc 


aca 


gee 


aca 


gcc 


cca 


caa 


ctg gat 


gat gag gag 


atg 


151 


Arg Ala 


Pro 


Leu 


Thr 


Ala 


Thr 


Ala 


Pro 


Gin 


Leu Asp 


Asp Glu Glu 


Met 






10 










15 








20 






tac tea 


gee 


cae 


atg 


ccc 


get 


cac 


ctg 


cgc 


tgt gat 


gee tge aga 


get 


199 


Tyr Ser 


Ala 


His 


Met 


Pro 


Ala 


His 


Leu 


Arg 


Cys Asp 


Ala Cys Arg 


Ala 




25 










30 








35 








gtg get 


tac 


cag 


gtg 


agt 


cct 


tea 


cca 


ctg 


tea cct 


gcc ctg etc 


aca 


247 
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Val Ala Tyr Gin Val Ser Pro Ser Pro Leu Ser Pro Ala Leu Leu Thr 
40 45 50 55 

ccc ctt etc aag cca gcc ccc acc ggg 274 
Pro Leu Leu Lys Pro Ala Pro Thr Gly 
60 



<210> 26 

<211> 230 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 29. .229 



<221> sig_peptide 

<222> 29. .94 

<223> Von Heijne matrix 

score 13-8000001907349 
seq LGLLLLWLRGARC/GV 



<400> 26 

aaggagtcag tctcagtcag gacacagc atg gac atg agg gtc ccc get cag 52 

Met Asp Met Arg Val Pro Ala Gin 
-20 -15 
etc ctg ggg etc ctg eta etc tgg etc cga ggt gcc aga tgt ggc gtc 100 
Leu Leu Gly Leu Leu Leu Leu Trp Leu Arg Gly Ala Arg Cys Gly Val 

-10 -5 1 

cag atg acc cag ttt cca ctg tec ctg tct gca teg gta gga gac aga 14 8 

Gin Met Thr Gin Phe Pro Leu Ser Leu Ser Ala Ser Val Gly Asp Arg 

5 10 15 

gtc acc ate act tgc egg aea age cat ata att aac ate ttt tta aat 196 
Val Thr lie Thr Cys Arg Thr Ser His lie lie Asn lie Phe Leu Asn 

20 25 30 

tgg tat cag cag aaa cca ggc aaa gee cct tgg g 230 
Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Trp 
35 40 45 



<210> 27 

<211> 195 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> 44. .193 



<221> sig_peptide 

<222> 44. .112 

<223> Von Heijne matrix 

score 13.8000001907349 
seq VLLLLLLSGDVQS/SE 



<400> 27 

agagggcttc cggggctgee ggtctgagtg cagagctget gtc atg gcg gee get 55 

Met Ala Ala Ala 
-20 

ctg tgg ggc ttc ttt ccc gtc ctg ctg ctg ctg ctg eta teg ggg gat 103 
Leu Trp Gly Phe Phe Pro Val Leu Leu Leu Leu Leu Leu Ser Gly Asp 

-15 -10 -5 

gtc cag age teg gag gtg ccc ggg get get get gag gga teg gga ggg 151 
Val Gin Ser Ser Glu Val Pro Gly Ala Ala Ala Glu Gly Ser Gly Gly 
15 10 
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agt ggg gtc ggc ata gga gak cgc ttc aag att gag gga ctg gg 195 

Ser Gly Val Gly lie Gly Xaa Arg Phe Lys lie Glu Gly Leu 

15 20 25 



<210> 28 

<211> 276 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 25. .276 



<221> sig_peptide 
<222> 25. .90 
<223> Von Heijne matrix 
score 13.5 

seq LGLLLLWLXGARC/DI 



<400> 28 

agtcagtctc agacaggaca cage atg gac atg agg gtc ccc get cag etc 51 

Met Asp Met Arg Val Pro Ala Gin Leu 

-20 -15 



ctg 


ggg 


etc 


ctg 


eta 


etc 


tgg 


etc 


yka 


ggt 


gee 


aga 


tgt 


gac 


ate 


cag 


99 


Leu Gly 


Leu 


Leu 


Leu 


Leu 


Trp 


Leu 


Xaa 


Gly 


Ala 


Arg 


Cys 


Asp 


He 


Gin 










-10 










-5 










1 








atg 


aca 


cag 


tet 


eea 


gtc 


ctg 


cet 


gea 


tet 


gta 


gga 


gac 


aga 


gtc 


ace 


147 


Met 


Thr 

5 


Gin 


Ser 


Pro 


Val 


Leu 

10 


Pro 


Ala 


Ser 


Val 


Gly 
15 


Asp 


Arg 


Val 


Thr 




ate 


act 


tgc 


egg 


gea 


agt 


cag 


age 


att 


ggc 


age 


tat 


tta 


aac 


tgg 


tat 


195 


He 


Thr 


Cys 


Arg 


Ala 


Ser 


Gin 


Ser 


He 


Gly 


Ser 


Tyr 


Leu 


Asn 


Trp 


Tyr 




20 










25 










30 










35 




cag 


cat 


aaa 


eea 


ggg 


cat 


gee 


cet 


cgc 


etc 


ctg 


ate 


tat 


get 


gea 


act 


243 


Gin 


His 


Lys 


Pro Gly His 


Ala 


Pro Arg 


Leu 


Leu 


He 


Tyr 


Ala 


Ala 


Thr 












40 










45 










50 






act 


ttg 


teg 


agg 


ggc 


ggs 


ecg 


gee 


aga 


ttc 


agt 












276 


Thr 


Leu 


Ser 


Arg Gly Gly 


Pro 


Ala 


Arg 


Phe 


Ser 















55 60 



<210> 29 
<211> 240 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 25. .240 



<221> sig_peptide 
<222> 25. .120 
<223> Von Heijne matrix 
score 13.5 

seq LLLLLLLPPPGSC/AG 



<400> 29 

agggcgctgc gcggcgcagc gaaa atg gcg get tec agg tgg gcg cgc aag 51 

Met Ala Ala Ser Arg Trp Ala Arg Lys 
-30 -25 
gee gtg gtc ctg ett tgt gee tet gac ctg ctg ctg ctg ctg eta ctg 99 
Ala Val Val Leu Leu Cys Ala Ser Asp Leu Leu Leu Leu Leu Leu Leu 

-20 -15 -10 

eta cca ecg cet ggg tec tgc gee ggc cga agg teg ccy dgg acg ccc 147 
Leu Pro Pro Pro Gly Ser Cys Ala Gly Arg Arg Ser Pro Xaa Thr Pro 
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-5 15 
gac gag tct acc cca cct ccc egg aag aag aag aag gat att cgc gat 195 
Asp Glu Ser Thr Pro Pro Pro Arg Lys Lys Lys Lys Asp lie Arg Asp 
10 15 20 25 

tac aat gat gca gac atg gcg cgt ctt ctg gag caa ggg gag ggg 240 
Tyr Asn Asp Ala Asp Met Ala Arg Leu Leu Glu Gin Gly Glu Gly 
30 35 40 

<210> 30 
<211> 461 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 80. .460 

<221> sig_peptide 
<222> 80. .136 
<223> Von Heijne matrix 
score 13.5 

seq WVLLLALLEGVQC/DV 

<221> misc_feature 
<222> 280. .281,311. .313 
<223> n=a, g, c or t 

<400> 30 

agctctcaga gaggtgcctt agccctggat tccaaggcat ttccacttgg tgatcagcac 60 
tgaacacaga ggactcacc atg gag ttg ggg ctg tgc tgg gtt etc ctt tta 112 

Met Glu Leu Gly Leu Cys Trp Val Leu Leu Leu 
-15 -10 



get ctt 


tta 


gaa 


ggt 


gtc 


caa tgt gac 


gtg 


gaa 


tta 


gtg 


gag 


tct 


ggg 


160 


Ala Leu 


Leu 


Glu 
-5 


Gly 


Val 


Gin Cys Asp 
1 


Val 


Glu 


Leu 


Val 
5 


Glu 


Ser 


Gly 




ggc ggc 


ttg 


gtg 


cag 


cct 


ggs ggg tct 


ctg 


aga 


ctt 


tec 


tgt 


gca 


gee 


208 


Gly Gly 


Leu 


Val 


Gin 


Pro Gly Gly Ser 


Leu Arg 


Leu 


Ser 


Cys 


Ala 


Ala 




10 










15 






20 












tct gga 


ttc 


aat 


ttt 


age 


act tat gag 


atg 


cat 


tgg 


ate 


cgc 


cag 


get 


256 


Ser Gly 


Phe 


Asn 


Phe 


Ser 


Thr Tyr Glu 


Met 


His 


Trp 


lie 


Arg 


Gin 


Ala 




25 








30 






35 










40 




cca ggg 


aag 


ggg 


ccg 


gag 


tgg gtn nca 


tat 


gtc 


agt 


ggt 


gga 


ggt 


gga 


304 


Pro Gly 


Lys 


Gly 


Pro 


Glu 


Trp Val Xaa 


Tyr 


Val 


Ser 


Gly 


Gly 


Gly 


Gly 










45 






50 










55 






acc agh 


nnn 


aac 


gcv 


sac 


tct gtg aag 


ggc 


cga 


ttc 


acc 


ate 


tec 


aga 


352 


Thr Xaa 


Xaa 


Asn 


Ala 


Xaa 


Ser Val Lys 


Gly 


Arg 


Phe 


Thr 


lie 


Ser 


Arg 








60 






65 










70 








gac aat 


gee 


aac 


agt 


ttt 


gtg tat eta 


caa 


atg 


gac 


agt 


ctg 


cga 


gtc 


400 


Asp Asn 


Ala 


Asn 


Ser 


Phe 


Val Tyr Leu 


Gin 


Met 


Asp 


Ser 


Leu 


Arg 


Val 






75 








80 








85 










gag gac 


acc 


get 


etc 


tat 


tac tgt gcg 


aga 


rgg 


gat 


tac 


gac 


ttc 


tgg 


448 


Glu Asp 


Thr 


Ala 


Leu 


Tyr Tyr Cys Ala 


Arg Xaa Asp 


Tyr 


Asp 


Phe 


Trp 




90 










95 






100 












agt ggt 


tat 


tat 


a 




















461 



Ser Gly Tyr Tyr 
105 

<210> 31 
<211> 112 
<212> DNA 

<213> Homo sapiens 



<220> 
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<221> CDS 
<222> 28. .111 

<221> sig_peptide 

<222> 28. .84 

<223> Von Heijne matrix 

score 13.3999996185303 

seq LLLLLSHCTGSLS/QP 

<400> 31 

aactgtgcat gtcaggctgt gtccacc atg gcc tgg act cct ctt ctt etc ttg 54 

Met Ala Trp Thr Pro Leu Leu Leu Leu 
-15 

etc etc tet cac tgc aca ggt tec etc tee cag cct gtg ctg act cag 102 

Leu Leu Ser His Cys Thr Gly Ser Leu Ser Gin Pro Val Leu Thr Gin 

-10 -5 15 

cca cgc ggg g 112 

Pro Arg Gly 

<210> 32 
<211> 445 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 80. .445 

<221> sigjpeptide 

<222> 80 . . 136 

<223> Von Heijne matrix 

score 12.8000001907349 

seq WVFLVALLRGVQC/QV 

<221> misc_feature 
<222> 2,7 

<223> n=a, g, e or t 
<400> 32 

anctctngga gaggagccca gcactagaag tcggcggtgt ttccattcgg tgatcagcac 60 
tgaacaeaga ggactcacc atg gag ttt ggg ctg aat tgg gtt ttc etc gtt 112 

Met Glu Phe Gly Leu Asn Trp Val Phe Leu Val 
-15 -10 
gtc cag tgt cag gtt cag ttg gtg gag tct ggg 160 
Val Gin Cys Gin Val Gin Leu Val Glu Ser Gly 

1 5 
cct ggg acg tec ctg aca ctt tec tgt gea ggc 208 
Pro Gly Thr Ser Leu Thr Leu Ser Cys Ala Gly 

15 20 
agt gat tat ggc ate cac tgg gtc cgc cag get 256 
Ser Asp Tyr Gly lie His Trp Val Arg Gin Ala 
30 35 40 

gaa tgg gtg gcg gtt att tea cac gat gga aat 304 
Glu Trp Val Ala Val lie Ser His Asp Gly Asn 

50 55 
ggc tec atg aag ggc cga gtc ace ate tee aga 352 
Gly Ser Met Lys Gly Arg Val Thr lie Ser Arg 

65 70 
acc gtg tct ttg caa atg age age ttg gga cct 400 
Thr Val Ser Leu Gin Met Ser Ser Leu Gly Pro 

80 . 85 
tat tac tgt gcg aaa gat cga acc ggg ggg 44 5 

Tyr Tyr Cys Ala Lys Asp Arg Thr Gly Gly 



get ctt tta aga ggt 
Ala Leu Leu Arg Gly 

-5 

gga ggc gtg gtc cag 
Gly Gly Val Val Gin 
10 

teg gga ttc agt ttc 
Ser Gly Phe Ser Phe 

25 

cca ggc aag ggg ctg 
Pro Gly Lys Gly Leu 
45 

aac aaa tat tat gga 
Asn Lys Tyr Tyr Gly 
60 

gac aac tec agg cat 
Asp Asn Ser Arg His 
75 

gag gac acg gea gtg 
Glu Asp Thr Ala Val 
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90 95 100 

<210> 33 

<211> 321 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 41. .319 

<221> sig_peptide 

<222> 41. ,97 

<223> Von Heijne matrix 

score 12.6000003814697 

seq FLLLLAAPRWVLS/QV 

<400> 33 

aaatacttts kgcagagtcc tggacctcct gtgcaagaac atg aaa ctt ctg tgg 55 

Met Lys Leu Leu Trp 
-15 

ttc ttc ctt etc ctg ctg gca get ccc aga tgg gtc ctg tec cag gtg 103 
Phe Phe Leu Leu Leu Leu Ala Ala Pro Arg Trp Val Leu Ser Gin Val 

-10 -5 1 

cag ctg gtg smg teg ggc eca gga ctg gtg aag cct teg ggg acc ctg 151 
Gin Leu Val Xaa Ser Gly Pro Gly Leu Val Lys Pro Ser Gly Thr Leu 

5 10 15 

tec eta aeg tge act gts ksb ggk grs ksc ata act aat tac tac tgg 199 
Ser Leu Thr Cys Thr Val Xaa Gly Xaa Xaa lie Thr Asn Tyr Tyr Trp 

20 25 30 

agt bgg ate egg cag tec cca ggg aag gga ctg gag tgg att ggg act 247 
Ser Xaa lie Arg Gin Ser Pro Gly Lys Gly Leu Glu Trp lie Gly Thr 
35 40 45 50 

ate tac tac agt ggg age gee gae cac aac eee tec etc agg agt mga 295 
lie Tyr Tyr Ser Gly Ser Ala Asp His Asn Pro Ser Leu Arg Ser Arg 

55 60 65 

gee act att tea tta gae acg cgc gg 321 
Ala Thr lie Ser Leu Asp Thr Arg 
70 

<210> 34 

<211> 193 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 49. . 192 

<221> sig_peptide 

<222> 49. .108 

<223> Von Heijne matrix 

score 12 . 5 

seq LLXLLTALPPLWS/SS 

<400> 34 

agagctcagg gtgckgagcg tgtgaccagc agtgageaga ggccggcc atg gee age 57 

Met Ala Ser 
-20 

ctg ggg ctg ctg etc ctg eke tta ctg aca gca ctg cca ccg ctg tgg 105 
Leu Gly Leu Leu Leu Leu Xaa Leu Leu Thr Ala Leu Pro Pro Leu Trp 

-15 -10 -5 

tec tec tea ctg cct ggg ctg gae ack get gaa agt aaa gee acc akt 153 
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Ser Ser Ser Leu Pro Gly Leu Asp Thr Ala Glu Ser Lys Ala Thr Xaa 

15 10 15 

gca gac ctg ate ctg tct gcg ctg gag aga gcc acc ggg g 193 
Ala Asp Leu lie Leu Ser Ala Leu Glu Arg Ala Thr Gly 
20 25 

<210> 35 
<211> 438 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 151. .438 

<221> sig_peptide 
<222> 151. .234 
<223> Von Heijne matrix 
score 12 . 5 

seq LLLLLLLPLRGQA/NT 

<400> 35 

acgagaaggg gagggggccc agccctgctt tgggcaatcc ttgctctgac cactcagaca 60 
ccgtgtcctc ttgcctggga gaggggaagc agatctgagg acatctctgt gccaggccag 120 
aaaccgccca cctgcagttc cttctccggg atg gac gtg ggg ccc age tec ctg 174 

Met Asp Val Gly Pro Ser Ser Leu 
-25 

ccc cac ctt ggg ctg aag ctg ctg ctg etc ctg ctg ctg ctg ccc etc 222 

Pro His Leu Gly Leu Lys Leu Leu Leu Leu Leu Leu Leu Leu Pro Leu 

-20 -15 -10 -5 

^99 99c caa gcc aac aca ggc tgc tac ggg ate eea ggg atg ccc gge 270 

Arg Gly Gin Ala Asn Thr Gly Cys Tyr Gly lie Pro Gly Met Pro Gly 

15 10 
ctg ccc ggg gca cca ggg aag gat ggg tac gac gga ctg ecg ggg ccc 318 
Leu Pro Gly Ala Pro Gly Lys Asp Gly Tyr Asp Gly Leu Pro Gly Pro 

15 20 25 

aag ggg gag eea gga ate eea gee att ccc ggg ate ega gga ccc aaa 366 
Lys Gly Glu Pro Gly lie Pro Ala lie Pro Gly lie Arg Gly Pro Lys 

30 35 40 

ggg cag aag gga gaa ccc ggc tta ccc ggc cat ect ggg aaa aat ggc 414 
Gly Gin Lys Gly Glu Pro Gly Leu Pro Gly His Pro Gly Lys Asn Gly 
45 50 55 60 

ccc atg gga ccc cct ggg atg cca 438 
Pro Met Gly Pro Pro Gly Met Pro 

65 

<210> 36 
<211> 488 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 59. .487 

<221> sig_j)eptide 

<222> 59. .115 

<223> Von Heijne matrix 

score 12.3999996185303 

seq ILLLVAAATGTHA/QV 

<221> misc_feature 
<222> 26. .28 
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<223> n=a, g, c or t 

<400> 36 



atcacacaac agmcacatcs swmvsnnnmc agaagccccc 


agagtgcage acctcacc 


58 


d L. ^ y 




acc 


tgg 


agg 


a tc 


etc 


etc 


ttg 


gtg 


qea 


gca 


get 


aca ggc 


106 






Thr 


Trp Arg 


He 


Leu 


Leu 


Leu 


Val 


Ala 


Ala 


Ala 


Thr Gly 










-15 










-10 










-5 




clCC C3C 


gcc 


cag 


gtc 


cag 


tta 


ata 


cag 


tct 


ggg 


cct 


gag 


gtg 


aaa aag 


154 


Thv "Hi Q 




Gin 

T 


Val 


Gin 


Leu 


Val 

5 


Gin Ser Gly 


Pro 


Glu 
10 


Val 


Lys Lys 




cct, gyy 


gcc 


J. 


gtg 


aag 


at" r* 
y UL, 


tee 


tgc 


cag 


gtt 


tee 


QQa 


tac 


aac gtc 


202 






Scr 


Val 


Lys 


Val 




Cys 


Gin 


Val 


Ser 


Glv 


Tyr Asn Val 
























25 










gtg gsLSi 


t- 1* s 


tec 


ate 


cac 


L.yy 


y i-y 


cgt 


cag 


teg 


cct 


aaa 
y 


aaa 


ggg ctt 


250 


vax uxii 


liSU 


Ssr 


He 


His 




V d JL 


Arg 


Gin 


Ser 


Pro 


Glv 


Lys 


Gly Leu 












35 










40 








45 




gag tgg 


acy 


gg^ 


ggt 


ttt 


gac 


L. L. 


gaa 


agt 


ggt 


gaa 


aca 


ate 


tac gca 


29B 


Glu Tirp 


Met 


Glv 


Gly 


Phe 


Asp 


Leu 


Glu 


Ser 


Gly 


Glu 


Thr 


He 


Tyr Ala 










50 










55 










60 




cag agg 


ttc 


cag 


ggc 


aga 


ate 


ace 


atg 


ace 


gag 


gac 


tea 


tct 


tea gac 


346 


Gin Arg 


Phe 


Gin 


Gly Arg 


He 


Thr 


Met 


Thr 


Glu 


Asp 


Ser 


Ser 


Ser Asp 








65 










70 










75 






aca gcc 


ttc 


atg 


gag 


ctg 


ate 


age 


ctg 


aga 


cct 


gaa 


gat 


gcg 


gcc gtc 


394 


Thr Ala 


Phe 


Met 


Glu 


Leu 


He 


Ser 


Leu 


Arg 


Pro 


Glu 


Asp 


Ala 


Ala Val 






80 










85 










90 








tac tac 


tgt 


gca 


acg 


ate 


egg 


ctg 


cea 


gta 


gtg 


ctt 


ttt 


ttc 


gcg get 


442 


Tyr Tyr 


Cys 


Ala 


Thr 


He 


Arg 


Leu 


Pro 


Val 


Val 


Leu 


Phe 


Phe 


Ala Ala 




95 










100 










105 










tct ggg 


gcc 


agg 


gaa 


ccc 


tgg 


teg 


ccg 


tct 


cct 


cag 


cmt 


cca 


egg g 


488 


Ser Gly 


Ala 


Arg 


Glu 


Pro 


Trp 


Ser 


Pro 


Ser 


Pro 


Gin 


Xaa 


Pro 


Arg 





110 115 120 

<210> 37 

<211> 138 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 26. .136 

<221> sig__peptide 

<222> 26. .79 

<223> Von Heijne matrix 

score 12.1000003814697 

seq VLLLAVLLLAVLC/KV 

<400> 37 

ttttacccga cccgacgccg gcgtg atg tgg ctt ccg ctg gtg ctg etc ctg 52 

Met Trp Leu Pro Leu Val Leu Leu Leu 
-15 -10 
get gtg ctg ctg ctg gcc gtc etc tgc aaa gtt tac ttg gga eta ttc 100 
Ala Val Leu Leu Leu Ala Val Leu Cys Lys Val Tyr Leu Gly Leu Phe 

-5 15 
tct ggc age tec ccg aat cct ttc tec gaa gaa agg gg 138 
Ser Gly Ser Ser Pro Asn Pro Phe Ser Glu Glu Arg 
10 15 

<210> 38 
<211> 163 
<212> DNA 

<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 9. .161 



<221> sig_peptide 
<222> 9. .83 

<223> Von Heijne matrix 

score 11.8999996185303 
seq WLLLLPLLLGLNA/GA 



<400> 38 

aacttgtc atg gag ctg gca ctg egg cgc tct ccc gtc ccg egg tgg ttg 50 
Met Glu Leu Ala Leu Arg Arg Ser Pro Val Pro Arg Trp Leu 
-25 -20 -15 

ctg ctg ctg ccg ctg ctg ctg ggc ctg aac gca gga get gtc att gac 98 
Leu Leu Leu Pro Leu Leu Leu Gly Leu Asn Ala Gly Ala Val lie Asp 

-10 -5 1 5 

tgg ccc aca gag gag ggc aag gaa gta tgg gat tat gtg acg gtc cgc 146 
Trp Pro Thr Glu Glu Gly Lys Glu Val Trp Asp Tyr Val Thr Val Arg 

10 15 20 

aag gat gcc tac atg gg 163 
Lys Asp Ala Tyr Met 
25 



<210> 39 

<211> 427 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 
<222> 35. .427 



<221> sig_j>eptide 

<222> 35. .91 

<223> Von Heijne matrix 

score 11.8999996185303 
seq FLFLLTCCPGSNS/QA 

<221> misc_feature 
<222> 138. .139 
<223> n=a, g, c or t 



<400> 39 

tctggcacca ggggtccctt ccaatatcag cacc atg gcc tgg act cct etc ttt 55 

Met Ala Trp Thr Pro Leu Phe 
-15 

ctg ttc etc etc act tgc tgc cea ggg tec aat tec cag get gtg gkg 103 
Leu Phe Leu Leu Thr Cys Cys Pro Gly Ser Asn Ser Gin Ala Val Xaa 

-10 -5 1 

act cag gag ccc etc act gac tgt gtc ccc egg ann aca gtc act etc 151 
Thr Gin Glu Pro Leu Thr Asp Cys Val Pro Arg Xaa Thr Val Thr Leu 
5 10 15 20 

acc tgt ggc tec agt att gga get gtc acc aat ggt cat ttt ccc tac 199 
Thr Cys Gly Ser Ser lie Gly Ala Val Thr Asn Gly His Phe Pro Tyr 

25 30 35 

tgg ttc caa cag aag cct ggc caa gcc ccc agg aca ctg att tct gat 247 
Trp Phe Gin Gin Lys Pro Gly Gin Ala Pro Arg Thr Leu lie Ser Asp 

40 45 50 

acg ttc aac aga cag tec teg aca cct gcc cgc ttc tct ggc tec etc 2 95 

Thr Phe Asn Arg Gin Ser Ser Thr Pro Ala Arg Phe Ser Gly Ser Leu 

55 60 65 

ctg ggg ggc aaa get gtc ctg act ctt teg gat gcg caa cct gac gat 343 



wo 99/53051 

Leu Gly Gly Lys Ala Val Leu Thr Leu 

70 75 

gag get gaa tat tat tgt gtc etc tec 

Glu Ala Glu Tyr Tyr Cys Val Leu Ser 

85 90 

ttc ggc gga ggg acc aag ctg acc gtc 

Phe Gly Gly Gly Thr Lys Leu Thr Val 
105 



PCT/IB99/00712 
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Ser Asp Ala Gin Pro Asp Asp 
80 

tat agt ggt ggt egg ccg gtg 391 
Tyr Ser Gly Gly Arg Pro Val 
95 100 
eta agt cag 427 
Leu Ser Gin 
110 



<210> 40 

<211> 97 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 22. .96 



<221> sig_peptide 

<222> 22. .84 

<223> Von Heijne matrix 

score 11.8999996185303 
seq LALCLLLGPLAGA/KP 



<400> 40 

agatcaggaa gcaccgggaa g atg cag gcc tgc atg gtg ccg ggg ctg gcc 51 

Met Gin Ala Cys Met Val Pro Gly Leu Ala 
-20 -15 
etc tgc etc eta ctg ggg cet ett gea ggg gee aag ect gtg cag g 97 
Leu Cys Leu Leu Leu Gly Pro Leu Ala Gly Ala Lys Pro Val Gin 
-10 -5 1 



<210> 41 

<211> 536 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 266. .535 



<221> sig_peptide 
<222> 266. .307 
<223> Von Heijne matrix 
score 15 

seq LLPLLLLLPMCWA/VE 



<400> 41 

acttttgggg teaegtgctc attccgtttc ectacctccc ccaaccttat cccgcccctg 60 
ggggttegeg ggcattttte aggaaettte tttecggctt gagaagcegc cactcccaag 120 
atgsageagg aaccgegget gctggaeaag aggggtgcgg tggataetga cetttgctce 180 
ggectcgtcg tgaagacaca gcgcatctcc ccgctgtagg cttcctccca cagaacccgt 240 
ttcgggcctc agagcgtctg gtgag atg ctg ttg ccg ctg ctg ctg ctg eta 292 

Met Leu Leu Pro Leu Leu Leu Leu Leu 
-10 

ccc atg tgc tgg gcc gtg gag gtc aag agg ccc egg ggc gtc tec etc 340 

Pro Met Cys Trp Ala Val Glu Val Lys Arg Pro Arg Gly Val Ser Leu 

-5 15 10 

aee aat eat cac ttc tac gat gag tee aag ect tte ace tgc ctg gae 388 

Thr Asn His His Phe Tyr Asp Glu Ser Lys Pro Phe Thr Cys Leu Asp 

15 20 25 

ggt teg gcc ace ate cca ttt gat cag gtc aac gat gac tat tgc gae 436 
Gly Ser Ala Thr lie Pro Phe Asp Gin Val Asn Asp Asp Tyr Cys Asp 



wo 99/53051 



PCT/1B99/00712 



30 

30 35 40 

tgc aaa gat ggc tct gac gag cca ggc acg get gcc tgt cct aat ggc 484 
Cys Lys Asp Gly Ser Asp Glu Pro Gly Thr Ala Ala Cys Pro Asn Gly 
45 50 55 

age ttc cac tgc acc aac act ggc tat aag ccc ctg tat ate cec tec 532 

Ser Phe His Cys Thr Asn Thr Gly Tyr Lys Pro Leu Tyr He Pro Ser 

60 65 70 75 

aac c 536 

Asn 



<210> 42 
<211> 319 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 
<222> 143 . . 319 



<221> sig_peptide 

<222> 143 . .205 

<223> Von Heijne matrix 

score 11.6000003814697 
seq LLLCLALSGAAET/KP 

<221> tnisc_feature 
<222> 139 

<223> n=a, g, c or t 



<400> 42 

ageagaggga acagggaaga aacctaaagg ctgcaggctg ccaggtgtgc ttggagagcc 60 
cccttcttcc gccgggcctc gcaagcagcg taggactgtg gagaagggcg gtgggcaagg 120 
agggaacteg agagcarcny cc atg ggc aca eag gag ggc tgg wgc ctg ctg 172 

Met Gly Thr Gin Glu Gly Trp Xaa Leu Leu 
-20 -15 
etc tgc ctg get eta tct gga gca gca gaa acc aag ccc cac cca gca 220 
Leu Cys Leu Ala Leu Ser Gly Ala Ala Glu Thr Lys Pro His Pro Ala 
-10 -5 15 

999 cag tgg egg gca gtg gdc gtg gtc eta gac ygt ttc ctg gtg 268 
Glu Gly Gin Trp Arg Ala Val Xaa Val Val Leu Asp Xaa Phe Leu Val 

10 15 20 

aag gac svt gcg cac cgt gga get etc gcc age agt gag gac agg gca 316 
Lys Asp Xaa Ala His Arg Gly Ala Leu Ala Ser Ser Glu Asp Arg Ala 

25 30 35 

agg 319 
Arg 



<210> 43 
<211> 412 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 35. .412 



<221> sig_peptide 

<222> 35. .82 

<223> Von Heijne matrix 

score 11.1999998092651 
seq LWFLLLWGVTWG/PV 



<221> tnisc feature 
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<222> 148 

<223> n=a, g, c or t 



<400> 43 

agacactcac tgcaccggag tgagcgcgac catc atg tec atg etc gtg gtc ttt 55 

Met Ser Met Leu Val Val Phe 

-15 -10 



etc 


ttg 


ctg 


tgg 


ggt 


gtc 


acc 


tgg ggc cca 


gtg 


aca gaa gca gee 


ata 


103 


Leu 


Leu 


Leu Trp 


Gly Val 

_c 


Thr 


Trp Gly Pro Val Thr Glu Ala Ala 


He 




ttt 


tat 


gag 


acg 


cag 


see 


age 


Ctg tgg gea 


gag 


tee gaa eae tgn 


ctg 


151 


Phe 


Tyr 


Glu 


Thr 


Gin 


Xaa 


Ser 


Leu Trp Ala Glu Ser Glu His Xaa Leu 








10 










15 




20 






aaa 


acc 


ctt 


ggc 


caa 


tgt 


gac 


get gac gtg 


cca 


ggc ccg cct gga 


gac 


199 


Lys 


Thr 


Leu Gly 


Gin 


Cys 


Asp 


Ala Asp Val 


Pro Gly Pro Pro Gly Asp 






25 










30 






35 






tee 


aga 


ett 


cea 


get 


gtt 


caa 


gaa tgg ggg 


gee 


eag gag cct gtg 


eae 


247 


Ser Arg 


Leu 


Pro 


Ala 


Val 


Gin 


Glu Trp Gly Ala 


Gin Glu Pro Val 


His 




40 










45 






50 




55 




ctt 


gac 


tea 


cet 


gee 


ate 


aag 


eae cag tte 


ctg 


ctg acg ggt gac 


acc 


295 


Leu Asp 


Ser 


Pro 


Ala 


He 


Lys 


His Gin Phe 


Leu 


Leu Thr Gly Asp 


Thr 












60 






65 




70 






cag 


ggc 


ege 


tac 


cgc 


tgc 


cgc 


teg ggc ttg 


tec 


aca gga tgg mcc 


cag 


343 


Gin 


Gly 


Arg 


Tyr 


Arg 


Cys 


Arg 


Ser Gly Leu 


Ser 


Thr Gly Trp Xaa 


Gin 










75 








80 




85 






ctg 


age 


aag 


etc 


ctg gag 


ctg 


aca ggg cca 


aaa 


gtc ett gee tgc 


tee 


391 


Leu 


Ser 


Lys 


Leu 


Leu 


Glu 


Leu 


Thr Gly Pro 


Lys 


Val Leu Ala Cys 


Ser 








90 










95 




100. 






ctg 


get 


etc 


gat 




gee 


agt 










412 


Leu 


Ala 


Leu Asp 


Gly Ala 


Ser 














105 










110 













<210> 44 

<211> 331 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 32. .331 



<22l> sig_peptide 

<222> 32. .88 

<223> Von Heijne matrix 

score 11.1999998092651 
seq IGPLLLWVPASRG/EI 



<400> 44 

atgagcaaaa etgacaagtc aaggeaggaa g atg ttg cca tea caa etc att 52 

Met Leu Pro Ser Gin Leu He 
-15 

ggg ttt e.tg ctg etc tgg gtt cca gee tee agg ggt gaa att gtg ctg 100 
Gly Phe Leu Leu Leu Trp Val Pro Ala Ser Arg Gly Glu He Val Leu 

-10 -5 1 

act eag tct cca gac ttt ctg tet gtg act cca aag gag aaa gtc acc 14 8 

Thr Gin Ser Pro Asp Phe Leu Ser Val Thr Pro Lys Glu Lys Val Thr 
5 10 15 20 

ate ace tgc egg gee agt sag age att ggt agt agt tta tac tgg tac 196 
He Thr Cys Arg Ala Ser Xaa Ser He Gly Ser Ser Leu Tyr Trp Tyr 

25 30 35 

cag eag aaa eca cat cag tet cea aag etc gtc ate aag tat get tec 244 
Gin Gin Lys Pro His Gin Ser Pro Lys Leu Val He Lys Tyr Ala Ser 
40 45 50 
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cag tec ttc tea ggg gtc tec teg agg ttc agt ggc agt gga tct ggg 292 
Gin Ser Phe Ser Gly Val Ser Ser Arg Phe Ser Gly Ser Gly Ser Gly 

55 60 65 

aca gat ttc acc etc aca ate aat age etg gaa cet ggg 331 
Thr Asp Phe Thr Leu Thr lie Asn Ser Leu Glu Pro Gly 
70 75 80 



<210> 45 

<211> 520 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 176. .520 

<221> sig_peptide 

<222> 176. .235 

<223> Von Heijne matrix 

score 11.1999998092651 
seq AFLLLVALSYTLA/RD 



<400> 45 

gaagataatc acttggggaa aggaaggttc gtttctgagt tagcaacaag taaatgcagc 60 
actagtgggt gggattgagg tatgccctgg tgcataaata gagactcagc tgtgctggca 120 
cactcagaag cttggacegc atcctagccg ccgactcaca caaggcagag ttgce atg 178 

Met 
-20 

gag aaa att eca gtg tea gca ttc ttg etc ett gtg gee etc tee tac 226 
Glu Lys lie Pro Val Ser Ala Phe Leu Leu Leu Val Ala Leu Ser Tyr 

-15 -10 -5 

act ctg gcc aga gat acc aca gtc aaa cct gga gcc aaa aag gac aca 274 
Thr Leu Ala Arg Asp Thr Thr Val Lys Pro Gly Ala Lys Lys Asp Thr 

15 10 . 

aag gac tct ega cee aaa ctg ccc cag acc etc tec aga ggt tgg ggt 322 
Lys Asp Ser Arg Pro Lys Leu Pro Gin Thr Leu Ser Arg Gly Trp Gly 

15 20 25 

gac caa etc ate tgg act cag aca tat gaa gaa get eta tat aaa tec 370 
Asp Gin Leu lie Trp Thr Gin Thr Tyr Glu Glu Ala Leu Tyr Lys Ser 
30 35 40 45 

aag aca age aac aaa ccc ttg atg att att cat cae ttg gat gag tge 418 
Lys Thr Ser Asn Lys Pro Leu Met lie lie His His Leu Asp Glu Cys 

50 55 60 

eca cae agt caa get tta aag aaa gtg ttt get gaa aat aaa gaa ate 466 
Pro His Ser Gin Ala Leu Lys Lys Val Phe Ala Glu Asn Lys Glu lie 

65 70 75 

cag aaa ttg gca gag cag ttt gtc etc etc aat ctg gtt tat gaa aca 514 
Gin Lys Leu Ala Glu Gin Phe Val Leu Leu Asn Leu Val Tyr Glu Thr 

80 85 90 

act gac 520 
Thr Asp 
95 



<210> 46 
<211> 383 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 25. .381 



<221> sig_j5eptide 
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<222> 25.-84 

<223> Von Heijne matrix 

score 11.1000003814697 

seq LLALLFFLGQAAG/DL 

<400> 46 

agcggctcca gctaagagga caag atg agg ccc ggc etc tea ttt etc eta 51 

Met Arg Pro Gly Leu Ser Phe Leu Leu 
-20 -15 
gee ctt ctg ttc ttc ctt ggc caa get gea ggg gat ttg ggg gat gtg 99 
Ala Leu Leu Phe Phe Leu Gly Gin Ala Ala Gly Asp Leu Gly Asp Val 

-10 -5 15 

gga cct cca att ccc age ccc ggc ttc age tet ttc cca ggt gtt gac 147 
Gly Pro Pro lie Pro Ser Pro Gly Phe Ser Ser Phe Pro Gly Val Asp 

10 15 20 

tec age tec age ttc age tee age tec agg teg ggc tec age tee age 195 
Ser Ser Ser Ser Phe Ser Ser Ser Ser Arg Ser Gly Ser Ser Ser Ser 

25 30 35 

cgc age tta ggc age gga ggt tct gtg tec cag ttg ttt tec aat ttc 243 
Arg Ser Leu Gly Ser Gly Gly Ser Val Ser Gin Leu Phe Ser Asn Phe 

40 45 50 

ace ggc tee gtg gat gac cgt ggg ace tge cag tge tet gtt tec ctg 291 
Thr Gly Ser Val Asp Asp Arg Gly Thr Cys Gin Cys Ser Val Ser Leu 

55 60 65 

cca gac ace acc ttt ccc gtg gac aga gtg gaa cgc ttg gaa ttc aca 339 
Pro Asp Thr Thr Phe Pro Val Asp Arg Val Glu Arg Leu Glu Phe Thr 
70 75 80 85 

get cat gtt ctt tct cag aag ttt gag aaa gaa ctt tet aaa ge 3 83 

Ala His Val Leu Ser Gin Lys Phe Glu Lys Glu Leu Ser Lys 
90 95 

<210> 47 

<211> 459 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 
<222> 17. .457 

<221> sig_j)eptide 

<222> 17. .94 

<223> Von Heijne matrix 

score 11.1000003814697 

seq FLLLVAAPRWVRS/QV 

<221> miscfeature 
<222> 399 

<223> n=a, g, c or t 
<400> 47 

ataetttetg agaete atg gac etc ctg cac aag aac atg aaa cac ctg tgg 52 
Met Asp Leu Leu His Lys Asn Met Lys His Leu Trp 
-25 -20 -15 

ttc ttc etc etc ctg gtg gea get ccc aga tgg gtc egg tct car gtg 100 
Phe Phe Leu Leu Leu Val Ala Ala Pro Arg Trp Val Arg Ser Gin Val 

-10 -5 1 

cag ctg cak gag teg ggc cca gga ctg gtg aag cct teg ggg ace ctg 148 
Gin Leu Xaa Glu Ser Gly Pro Gly Leu Val Lys Pro Ser Gly Thr Leu 

5 10 15 

tec etc ate tge ggt gtc tct ggt gat tec gtc acc att agt ggt tgg 196 
Ser Leu lie Cys Gly Val Ser Gly Asp Ser Val Thr lie Ser Gly Trp 
20 25 30 
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agt 


tgg 


gte 


cgc 


cag 


ccc 


cca ggg aag gga 


ctg 


gag 


tgg 


att 


teg 


244 


Trp 


Ser Trp Val 


Arg 


Gin 


Pro 


Pro Gly Lys Gly 


Leu 


Glu 


Trp 


He 


Ser 




35 










40 




45 










50 




gaa 


ate 


gat 


cat 


ggt 


gga 


aac 


acc aat tac aac 


ccg 


tec 


etc 


aag 


agt 


292 


Glu 


He 


Asp 


His 


Gly 


Gly 


Asn 


Thr Asn Tyr Asn 


Pro 


Ser 


Leu 


Lys 


Ser 












55 






60 








65 






cga 


gtc 


kcc 


att 


tct 


tta 


gac 


aag tec aag aat 


aag 


ttc 


tec 


ctg 


agg 


340 


Arg 


Val 


Xaa 


He 


Ser 


Leu 


Asp 


Lys Ser Lys Asn 


Lys 


Phe 


Ser 


Leu 


Arg 










70 








75 






80 








ctg 


acc 


tct 


gtg 


acc 


gee 


gcg 


gac acc gcc atg 


tat 


kac 


tgt 


gcg 


aga 


388 


Leu 


Thr 


Ser 


Val 


Thr 


Ala 


Ala 


Asp Thr Ala Met 


Tyr 


Xaa 


Cys 


Ala 


Arg 








85 










90 




95 










ggc 


ggt 


gcg 


bnc 


age 


teg 


tec 


get ttt gat gtc 


tgg 


ggc 


eta 


rgg 


aca 


436 


Gly 


Gly Ala Xaa 


Ser 


Ser 


Ser 


Ala Phe Asp Val 


Trp 


Gly 


Leu 


Xaa 


Thr 






100 










105 




110 












atg 


gtc 


ate 


ate 


tct 


tea 


gee 


tc 












459 


Met 


Val 


He 


He 


Ser 


Ser 


Ala 

















115 120 



<210> 48 
<211> 437 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 20. -436 



<221> sigjeptide 
<222> 20. .76 
<223> Von Heijne matrix 
score 11 

seq TLLLLTVPSWVLS/QV 



<400> 48 

gtgaatcctg ctctccace atg gac ata ett tgt tec aeg etc ctg ctm ctg 52 

Met Asp He Leu Cys Ser Thr Leu Leu Leu Leu 
-15 -10 
ack gtc ccg tec tgg gtc tta tec car gtc acc ttg arg gaa tct ggt 100 
Thr Val Pro Ser Trp Val Leu Ser Gin Val Thr Leu Xaa Glu Ser Gly 

-5 15 
cct gcg ctg gtg aaa gcc aca cag acc etc aga ctg acc tgc acc ttc 148 
Pro Ala Leu Val Lys Ala Thr Gin Thr Leu Arg Leu Thr Cys Thr Phe 

10 15 20 

tct ggg ttc tea etc age act aat aga atg cgt gtg agt tgg ate cgt 196 
Ser Gly Phe Ser Leu Ser Thr Asn Arg Met Arg Val Ser Trp He Arg 
25 30 35 40 

cag ccc cca ggg aag gee ctg gag tgg ett gea egg att gat tgg gat 244 
Gin Pro Pro Gly Lys Ala Leu Glu Trp Leu Ala Arg He Asp Trp Asp 

45 50 55 

gat tat aag agg tac age aca tct ctg aag acc agg gtc acc ate tec 292 
Asp Tyr Lys Arg Tyr Ser Thr Ser Leu Lys Thr Arg Val Thr He Ser 

60 65 70 

aag gac acg tec aaa aac cag gtg ate ctg aca atg acc aac gtg gac 340 
Lys Asp Thr Ser Lys Asn Gin Val He Leu Thr Met Thr Asn Val Asp 

75 80 85 

cct gcg gac aca gee acc tat tac tgt gea cgc ett tea acg gea get 3 88 

Pro Ala Asp Thr Ala Thr Tyr Tyr Cys Ala Arg Leu Ser Thr Ala Ala 

90 95 100 

acc cca cag ttt ttt gac ttc tgg ggc cag gga gtc ctg gte tec gtc t 437 
Thr Pro Gin Phe Phe Asp Phe Trp Gly Gin Gly Val Leu Val Ser Val 
105 110 115 120 
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<210> 49 

<211> 456 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 40. .456 



<221> sig_peptide 

<222> 40.-96 

<223> Von Heijne matrix 

score 10.8999996185303 
seq FLLLVAAPRWVLS/QV 



<400> 49 

aaatactttc tgagagtcct ggacctcctg tgcaagaac atg adw cat ctg tgg 54 

Met Xaa His Leu Trp 

-15 

102 



150 



ttc 


ttc 


ctt 


etc 


ctg gtg 


gca 


get cec 


aga 


tgg 


gtc 


ctg 


tec 


cag 


gtg 


Phe 


Phe 


Leu 


Leu 


Leu Val 


Ala 


Ala Pro 


Arg Trp 


Val 


Leu 


Ser 


Gin 


Val 










-10 






-5 










1 




cag 


ctg 


cag 


gag 


teg ggc 


cca 


gga ctg 


gtg 


aag 


eet 


teg 


kwg 


ace 


ctg 


Gin 


Leu 


Gin 


Glu 


Ser Gly 


Pro 


Gly Leu 


Val 


Lys 


Pro 


Ser 


Xaa 


Thr 


Leu 






5 








10 








15 








tec 


etc 


acc 


tgc 


act gtc 


tct 


ggt gae 


tec 


ate 


agt 


agt 


tac 


tac 


tgg 


Ser 


Leu 


Thr 


Cys 


Thr Val 


Ser 


Gly Asp 


Ser 


He 


Ser 


Ser 


Tyr 


Tyr 


Trp 




20 








25 








30 










age 


tgg 


ate 


egg 


cag ccc 


cca 


ggg aag 


gga 


ctg 


gag 


tgg 


att 


ggc 


tat 


Ser 


Trp 


He 


Arg 


Gin Pro 


Pro 


Gly Lys 


Gly Leu Glu 


Trp 


He 


Gly 


Tyr 


35 








40 








45 










50 


ate 


tat 


tac 


agt 


ggg age 


ace 


aac tac 


aac 


ccc 


tec 


etc 


aag 


agt 


ega 


lie 


Tyr 


Tyr 


Ser 


Gly Ser 


Thr 


Asn Tyr 


Asn 


Pro 


Ser 


Leu 


Lys 


Ser 


Arg 










55 






60 










65 




gtc 


acc 


ata 


tea 


gtg gae 


acg 


tec aag 


aac 


caa 


ttc 


tec 


ctg 


aag 


ctg 


Val 


Thr 


He 


Ser 


Val Asp 


Thr 


Ser Lys 


Asn 


Gin 


Phe 


Ser 


Leu 


Lys 


Leu 








70 






75 










80 






age 


tct 


gtg 


acc 


gca gcg 


gae 


acg gee 


gtg 


tat 


tac 


tgt 


gcg 


aga 


sgg 


Ser 


Ser 


Val 


Thr 


Ala Ala 


Asp 


Thr Ala 


Val 


Tyr 


Tyr 


Cys 


Ala 


Arg 


Xaa 






85 








90 








95 








ctg 


cma 


tac 


tat 


gat agg 


agt 


ggt tat 


ttc 


aga 


tat 


ttt 


gae 


tac 


tgg 


Leu 


Xaa 


Tyr 


Tyr 


Asp Arg 


Ser 


Gly Tyr 


Phe Arg Tyr 


Phe 


Asp 


Tyr 


Trp 




100 








105 








110 











198 



246 



294 



342 



390 



438 



ggc cag gga acc tgg tea 456 
Gly Gin Gly Thr Trp Ser 
115 120 



<210> 50 

<211> 447 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 38. .445 



<221> sig_peptide 

<222> 38. .94 

<223> Von Heijne matrix 

score 10.8999996185303 
seq FLLLVAAPRWVLS/QV 



<221> misc feature 
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36 

<222> 16 

<223> n=a, g, c or t 
<400> 50 

atactttctg agagtnctgg acctcctgtg caagaac atg aaa cat ctg tgg ttc 55 

Met Lys His Leu Trp Phe 
-15 



ttc 


etc 


etc 


ctg 


gtg 


gca 


get 


ccc aga 


tgg 


gtc 


ctg 


tee 


cag 


gtg 


cag 


103 


Pne 


Leu 


Leu 


Leu 


Val 


Ala 


Ala 


Pro Arg 


Trp Val 


Leu 


Ser 


Gin 


vai 


Gin 










-10 








- 5 










1 








ctg 


cag 


gag 


teg 


ggc 


cea 


gga 


ctg gtg 


aag 


cct 


tea 


cag 


ace 


ctg 


tec 


151 


Leu 


Gin 


Glu 


Ser Gly 


Pro 


Gly 


Leu Val 


Lys 


Pro 


Ser 


Gin 


Thr 


Leu 


Ser 






5 










10 








15 












etc 


acc 


tgc 


aca 


gtc 


tct 


ggt 


ggc tec 


ate 


gac 


agt 


ggt 


aat 


tac 


tac 


199 


Leu 


Thr 


Cys 


Thr 


Val 


Ser 


Gly 


Gly Ser 


He 


Asp 


Ser 


Gly 


Asn 


Tyr 


Tyr 




20 










25 








30 










35 




tgg 


age 


tgg 


ate 


egg 


cag 


ccc 


gee ggg 


aag 


gga 


ctg 


gag 


tgg 


att 


ggg 


247 


Trp 


Ser 


Trp 


lie 


Arg 


Gin 


Pro 


Ala Gly 


Lys Gly 


Leu 


Glu 


Trp 


He 


Gly 












40 








45 










50 






cgc 


ate 


tat 


agt 


act 


ggg 


age 


acc aat 


tac 


aac 


ccc 


tec 


etc 


age 


agt 


295 


Arg 


lie 


Tyr 


Ser Thr Gly 


Ser 


Thr Asn 


Tyr Asn 


Pro 


Ser 


Leu 


Ser 


Ser 










55 








60 










65 








eg a 


gtc 


cag 


ata 


teg 


tta 


gac 


acg tec 


aag 


aae 


ctg 


etc 


tec 


ttg 


aac 


343 


Arg 


Val 


Gin 
70 


He 


Ser 


Leu 


Asp 


Thr Ser 
75 


Lys 


Asn 


Leu 


Leu 
80 


Ser 


Leu 


Asn 




ctg 


acc 


tct 


gtg 


ace 


gee 


gca 


gac acg 


gcc 


gtc 


tat 


ttt 


tgt 


gcg 


cga 


391 


Leu 


Thr 
85 


Ser 


Val 


Thr 


Ala 


Ala 
90 


Asp Thr 


Ala 


Val 


Tyi^ 

95 


Phe 


Cys 


Ala 


Arg 




acc 


ttc 


ccc 


ttc 


tac 


tgg 


tac 


etc gat 


etc 


tgg 


ggc 


egt 


ggc 


ate 


etg 


439 


Thr 


Phe 


Pro 


Phe 


Tyr 


Trp 


Tyr 


Leu Asp 


Leu 


Trp 


Gly 


Arg 


Gly 


He 


Leu 




100 










105 








110 










115 




gtc 


act 


gt 


























447 



Val Thr 

<210> 51 

<211> 466 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 38. .466 

<22l> sig_peptide 

<222> 38. .94 

<223> Von Heijne matrix 

score 10.8999996185303 
seq FLLLVAAPRWVLS/QV 

<221> misc_feature 
<222> 423 

<223> n=a, g, e or t 



<400> 51 

atactttctg agagtcctgg acctcctgtg caagaac atg aaa cac ctg tgg ttc 55 

Met Lys His Leu Trp Phe 

-15 

ttc etc etg ctg gtg gca get ccc aga tgg gtc ctg tec cag gtg cag 103 
Phe Leu Leu Leu Val Ala Ala Pro Arg Trp Val Leu Ser Gin Val Gin 

-10 -5 1 

ctg cag gag teg ggc cca aga ctg gtg aag cct tea cag ace ctg tec 151 



wo 99/53051 





V3J.11 

5 


O J. u 


Cc» >- 

otrx 


Gly 


Pro 


etc 


ace 




act 


ate 


tct 


Leu 


Thr 


Cvs 


Thr 


Val 


Ser 


20 










25 




a.at 


taa 
^yy 


ate 


cgc 


caa 


TrD 


Ser 


Tro 


He 


Ara 


Gin 










40 




tac 


ate 


tat 


tac 


aat 


taq 


Tvr 


He 


Tvr 


Tvr 


Asn 


Trp 








55 






cga 


gtt 


acc 


atg 


tea 


ato 


Arg 


Val 


Thr 


Met 


Ser 


Met 






70 








ctg 


aac 


tct 


gta 


act 


gee 


Leu 


Asn 


Ser 


Val 


Thr 


Ala 




85 










ggt 


cgc 


gga 


cgc 


ett 


ggc 


Gly Arg 


Gly Arg 


Leu 


Gly 


100 










105 


ggt 


cac 


cgt 


etc 


ate 


age 


Gly His 


Arg 


Leu 


He 


Ser 



120 



37 

Arg Leu Val Lys Pro Ser 
10 15 
ggt ggc tec ate age agt 
Gly Gly Ser He Ser Ser 
30 

cac eca ggg egg ggc ctg 
His Pro Gly Arg Gly Leu 
45 

age ace tac tac aat ccg 
Ser Thr Tyr Tyr Asn Pro 
60 

gac acg tct aag aac eag 
Asp Thr Ser Lys Asn Gin 
75 

geg gac acg gsc atg tat 
Ala Asp Thr Xaa Met Tyr 
90 95 
tgg tte ash met tng ggg 
Trp Phe Xaa Xaa Xaa Gly 
110 

cgt eca ggg 
Arg Pro Gly 



PCT/1B99/00712 



Gin Thr Leu Ser 

ggt ggt tac tte 199 
Gly Gly Tyr Phe 
35 

gag tgg att ggc 247 
Glu Trp He Gly 
50 

tec etc agg agt 295 
Ser Leu Arg Ser 
65 

tte tec ctg aac 343 

Phe Ser Leu Asn 

80 

tac tgt gcs aga 391 
Tyr Cys Ala Arg 

mea ggg rac eca 439 
Xaa Gly Xaa Pro 
115 

466 



<210> 52 

<211> 392 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 59. .391 



<221> sig^peptide 

<222> 59. .115 

<223> Von Heijne matrix 

score 10.8999996185303 
seq FLLLVAAPRWVLS/QV 

<221> misc_feature 
<222> 342 

<223> n=a, g, c or t 



<400> 52 

agggtcctgc teacatggga aatactttet gagagtcetg gaectcctgt geaagaac 58 
atg aaa cac ctg tgg tte tte etc ctg ctg gtg gea get cce aga tgg 106 
Met Lys His Leu Trp Phe Phe Leu Leu Leu Val Ala Ala Pro Arg Trp 

-15 -10 -5 

gtc ctg tec eag gtg cag ctg eag gag teg ggc eca gga ctg gtg aag 154 
Val Leu Ser Gin Val Gin Leu Gin Glu Ser Gly Pro Gly Leu Val Lys 

15 10 
ect tea gag acc ctg tee etc acc tgc act gtc tct ggt ggc tec ate 202 
Pro Ser Glu Thr Leu Ser Leu Thr Cys Thr Val Ser Gly Gly Ser He 

15 20 25 

agg act ggt tct tac tac tgg act tgg gtt cgc cag cce cce ggg aag 250 
Arg Thr Gly Ser Tyr Tyr Trp Thr Trp Val Arg Gin Pro Pro Gly Lys 
30 35 40 45 

ggc ctg gag tgg att ggc tac att tat tat act ggg gac acc tac tac 298 
Gly Leu Glu Trp He Gly Tyr He Tyr Tyr Thr Gly Asp Thr Tyr Tyr 

50 55 60 

aac ccg tec etc aag agt cga att ace atg ter eta gac acg tny wag 346 
Asn Pro Ser Leu Lys Ser Arg He Thr Met Ser Leu Asp Thr Xaa Xaa 
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38 

65 70 75 

aac cag ttc kcc ctg age ctg acc tct gtg act gtc gca gac acg g 392 
Asn Gin Phe Xaa Leu Ser Leu Thr Ser Val Thr Val Ala Asp Thr 
80 85 90 

<210> 53 
<211> 172 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 14 . .172 

<221> sig_peptide 
<222> 14. .58 
<223> Von Heijne matrix 

score 10.8999996185303 

seq LSVCLLLVTLALC/CY 

<400> 53 

aaaacaagcc acc atg aag ctg teg gtg tgt etc ctg ctg gtc acg ctg 
Met Lys Leu Ser Val Cys Leu Leu Leu Val Thr Leu 
-15 -10 -5 

gcc etc tgc tgc tac cag gcc aat gcc gag ttc tgc cca get ctt gtt 

Ala Leu Cys Cys Tyr Gin Ala Asn Ala Glu Phe Cys Pro Ala Leu Val 
15 10 

tct gag ctg tta gac ttc ttc ttc att agt gaa cet ctg ttc aag tta 

Ser Glu Leu Leu Asp Phe Phe Phe lie Ser Glu Pro Leu Phe Lys Leu 
15 20 25 

agt ctt gee aaa ttt gat gcc cet cga 

Ser Leu Ala Lys Phe Asp Ala Pro Arg 

30 35 

<210> 54 
<211> 259 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 190. .258 

<22l> sig_peptide 
<222> 190. .237 
<223> Von Heijne matrix 

score 10.8999996185303 

seq VLLVLSLSQCLLS/DP 

<400> 54 

tacctggaaa gaacagaaat ttgttaattt acaggtctga aggtgagaaa tctgaaatta 
gtcttacaaa actaaaatga agttgttgga agccttggct ccttctggag gttccagggg 
aaaaaagtat gtttccttga ctttccagcc kstacaggcc cacagcattc ctgcttgcag 
ccctatgtc atg tea cet gtc etc ttg gtg ctg tea ttg tea caa tgc ctt 
Met Ser Pro Val Leu Leu Val Leu Ser Leu Ser Gin Cys Leu 
-15 -10 -5 

ctt tct gac cet gtc att cet ggc etc c 
Leu Ser Asp Pro Val lie Pro Gly Leu 
1 5 



49 
97 
145 
172 



60 
120 
180 
231 



259 



<210> 55 
<211> 320 
<212> DNA 



wo 99/53051 

<213> Homo sapiens 



39 
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<220> 
<221> CDS 
<222> 40. .318 

<221> sig_peptide 

<222> 40. .96 

<223> Von Heijne matrix 

score 10.8999996185303 

seq FLLLVAAPRWVLS/QV 

<400> 55 

aaacacsyyc tgagagtcct ggacctcctg tgcaggacc atg aaa cat ctg tgg 54 

Met Lys His Leu Trp 
-15 



ttc 


ttc 


ett 


etc 


ctg 


gtg 


gea 


get 


ccc 


aga 


tgg 


gtc 


ctg 


tee 


cag 


gtg 


102 


Phe 


Phe 


Leu 


Leu 


Leu 
-10 


Val 


Ala 


Ala 


Pro 


Arg 
-5 


Trp 


val 


Leu 


Ser 


Gin 
1 


Val 




egg 


ctg 


cag 


gag 


teg 


ggc 


cea 


egg 


ctg 


gtg 


aag 


cet 


teg 


gag 


amc 


ctg 


150 


Arg 


Leu 


Gin 
5 


Glu 


Ser 


Gly 


Pro 


Arg 
10 


Leu 


Val 


Lys 


Pro 


Ser 
15 


Glu 


Xaa 


Leu 




tec 


etc 


acc 


tgt 


agt 


gtc 


tct 


ggt 


gtc 


tec 


gtc 


act 


aat 


ttc 


ttc 


tgg 


198 


Ser 


Leu 
20 


Thr 


Cys 


Ser 


Val 


Ser 
25 


Gly 


Val 


Ser 


Val 


Thr 
30 


Asn 


Phe 


Phe 


Trp 




aac 


tgg 


ate 


egg 


aag 


ccc 


cca 


ggc 


aag 


ggc 


ctg 


gag 


tgg 


ett 


ggc 


tac 


246 


Asn 


Trp 


lie 


Arg 


Lys 


Pro 


Pro 


Gly 


Lys 


Gly 


Leu Glu Trp Leu Gly Tyr 




35 










40 










45 










50 




atg 


tct 


tat 


ggc 


gtg 


age 


aca 


aac 


tat 


cae 


ccc 


gee 


tac 


cag 


agt 


egg 


2 94 


Met 


Ser 


Tyr 


Gly 


Val 


Ser 


Thr 


Asn 


Tyr 


His 


Pro Ala Tyr Gin Ser Arg 












55 










60 










65 






gtc 


agt 


ata 


teg 


att 


gae 


aeg 


tgg 


gg 
















320 


Val 


Ser 


lie 


Ser 


He 


Asp 


Thr 


Trp 





















70 



<210> 56 
<211> 457 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 39. .455 

<22l> sig_peptide 

<222> 39. .95 

<223> Von Heijne matrix 

score 10.8999996185303 

seq FLLLVAAPRWVLS/QV 

<400> 56 

aatactttct gagagtcetg gacctectgt geaagaac atg aaa cat ctg tgg ttc 56 

Met Lys His Leu Trp Phe 
-15 

ttc ett etc ctg gtg gea get ccc aga tgg gtc ctg tec cag gtg cag 104 
Phe Leu Leu Leu Val Ala Ala Pro Arg Trp Val Leu Ser Gin Val Gin 

-10 -5 1 

ctg cag gag gcg ggc cca cga ctg gtg aag ect teg gag gee ctg tec 152 
Leu Gin Glu Ala Gly Pro Arg Leu Val Lys Pro Ser Glu Ala Leu Ser 

5 10 15 

etc ace tgc act gtc tct ggt gtc tec age age aat tac gac tgg agt 200 
Leu Thr Cys Thr Val Ser Gly Val Ser Ser Ser Asn Tyr Asp Trp Ser 
20 25 30 35 
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tgg att egg cag gcc cca 
Trp lie Arg Gin Ala Pro 
40 

gac gat agt aag aat aga 
Asp Asp Ser Lys Asn Arg 
55 

cga gtc acc ata teg 
Ser Arg Val Thr lie Ser 
70 

agg ctg acc tct gtg acc 
Arg Leu Thr Ser Val Thr 

85 

aga aag tea tet atg eat 
Arg Lys Ser Ser Met His 
100 105 
tgg tac ttc gat cct gg 
Trp Tyr Phe Asp Pro 
120 

<210> 57 

<211> 420 

<212> DNA 

<213> Homo sapiens 



40 

ggg aag gga ctg gaa tgg 
Gly Lys Gly Leu Glu Trp 
45 

999 acg acc tac aac 
Gly Ser Thr Thr Tyr Asn 
60 

stg gac acg tec aag ast 
Xaa Asp Thr Ser Lys Xaa 
75 

kcs gea gac acg gcc gtc 
Xaa Ala Asp Thr Ala Val 
90 95 
agt agt ggc tgg cat aac 
Ser Ser Gly Trp His Asn 
110 



att ggg tat ata 248 
lie Gly Tyr lie 
50 

ccc tec etc aag 296 
Pro Ser Leu Lys 
65 

cag ttg tec ctg 344 

Gin Leu Ser Leu 

80 

tat tat tgt gcg 392 

Tyr Tyr Cys Ala 

^99 agt etc tac 440 
Arg Ser Leu Tyr 
115 

457 



<220> 
<221> CDS 
<222> 17. .418 



<221> sig_jpeptide 

<222> 17. .94 

<223> Von Heijne matrix 

score 10.8999996185303 

seq FLLLVAAPRWVLS/QV 

<400> 57 

atactctctg agactc atg gac etc ctg cac aag aac atg aaa gac ctg tgg 52 
Met Asp Leu Leu His Lys Asn Met Lys Asp Leu Trp 
-25 -20 -15 

ttc ttc etc etc ctg gtg gea get ccc aga tgg gtc ctg tct cag gtg 100 
Phe Phe Leu Leu Leu Val Ala Ala Pro Arg Trp Val Leu Ser Gin Val 

-10 -5 1 

ctg cag gag teg ggc cca gga ctg gtg aag cct teg ggg acc ctg tec 148 
Leu Gin Glu Ser Gly Pro Gly Leu Val Lys Pro Ser Gly Thr Leu Ser 

5 10 15 

etc ace tgc get gtc tet ggt ggc tee ate ata agt agt aat tgg tgg 196 
Leu Thr Cys Ala Val Ser Gly Gly Ser He He Ser Ser Asn Trp Trp 

20 25 30 

agt tgg gtc egc cag acc cca ggg aag ggg ctg gag tgg att ggg gaa 244 
Ser Trp Val Arg Gin Thr Pro Gly Lys Gly Leu Glu Trp He Gly Glu 
35 40 45 50 

ate tat gaa gat ggg ate acc aac tac aac ccg tec etc aag agt cga 292 
He Tyr Glu Asp Gly He Thr Asn Tyr Asn Pro Ser Leu Lys Ser Arg 

55 60 65 

gtc ate att tea gtg gac aag gee aag aac cag ttc tee ctg aag atg 340 
Val He He Ser Val Asp Lys Ala Lys Asn Gin Phe Ser Leu Lys Met 

70 75 80 

agg tct gtg acc gcc teg gac acg gcc gtc tat tac tgt gcg aga ggt 388 
Arg Ser Val Thr Ala Ser Asp Thr Ala Val Tyr Tyr Cys Ala Arg Gly 

85 90 95 

age age teg gtt egg aca gac tac tgg ggc ca 420 
Ser Ser Ser Val Arg Thr Asp Tyr Trp Gly 
100 105 
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<210> 58 

<2I1> 469 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 38. .469 

<221> sig_peptide 

<222> 38. .94 

<223> Von Heijne matrix 

score 10.8999996185303 

seq FLLLVAAPRWVLS/QV 

<400> 58 

atactttctg agagtcctgg acctcctgtg caagaac atg aaa cac ctg tgg ttc 55 

Met Lys His Leu Trp Phe 
-15 



ttc 


etc 


ctg 


ctg 


gtg 


gca 


get 


ccc 


aga 


tgg 


gte 


ctg 


tec 


cag 


gtg 


cag 


103 


Phe 


Leu 


Leu 


Leu 

-10 


Val 


Ala 


Ala 


Pro 


Arg 
-5 


Trp 


Val 


Leu 


Ser 


Gin 
1 


Val 


Gin 




ctg 


cag 


gag 


tee 


ggt 


tea 


gga 


eeg 


gtg 


gat 


set 


tsa 


cag 


ace 


ctg 


tsc 


151 


Leu 


Gin 

5 


Glu 


Ser 


Gly 


Ser 


Gly 
10 


Pro 


Val 


Asp 


Xaa 


Xaa 
15 


Gin 


Thr 


Leu 


Xaa 




etc 


acc 


tgc 


act 


gks 


tct 


ggt 


gte 


tee 


ate 


age 


agt 


agt 


gat 


aat 


tgt 


199 


Leu 


Thr 


Cys 


Thr 


Xaa 


Ser 


Gly 


Val 


Ser 


He 


Ser 


Ser 


Ser 


Asp 


Asn 


Cys 




20 










25 










30 










35 




tgg 


age 


tgg 


ate 


egg 


eag 


eca 


eca 


ggg 


aag 


ggc 


ctg 


gag 


tgg 


att 


gga 


247 


Trp 


Ser 


Trp 


He 


Arg 


Gin 


Pro 


Pro 


Gly Lys 


Gly 


Leu 


Glu 


Trp 


He 


Gly 












40 










45 










50 






tac 


ate 


tat 


cay 


agt 


ggg 


ggg 


ace 


tac 


tac 


aae 


eeg 


ace 


etc 


aag 


age 


295 


Tyr 


He 


Tyr 


His 


Ser 


Gly 


Gly 


Thr 


Tyr Tyr 


Asn 


Pro 


Thr 


Leu 


Lys 


Ser 










55 










60 










65 








cga 


gte 


ace 


ate 


teg 


gba 


gac 


agg 


ate 


agg 


aac 


eaa 


ttc 


tee 


ctg 


aag 


343 


Arg 


Val 


Thr 
70 


He 


Ser 


Xaa 


Asp 


Arg 
75 


He 


Arg 


Asn 


Gin 


Phe 
80 


Ser 


Leu 


Lys 




ctg 


age 


tct 


gtg 


acg 


gee 


gyg 


gae 


acg 


gee 


gtg 


tat 


kac 


tgt 


ggc 


aga 


391 


Leu 


Ser 
85 


Ser 


Val 


Thr 


Ala 


Xaa 
90 


Asp 


Thr 


Ala 


Val 


Tyr 
95 


Xaa 


Cys 


Gly 


Arg 




gca 


cag 


ggt 


aga 


atg 


ggg 


ate 


ggg 


acg 


acg 


att 


ttt 


gat 


etc 


tgg 


ggc 


439 


Ala 


Gin 


Gly Arg 


Met 


Gly 


He 


Gly 


Thr 


Thr 


He 


Phe 


Asp 


Leu 


Trp 


Gly 




100 










105 










110 










115 




ggg 


gga 


eaa 


tgg 


tea 


eeg 


tct 


ctg 


cag 


cct 














469 


Gly 


Gly 


Gin 


Trp 


Ser 


Pro 


Ser 


Leu 


Gin 


Pro 

















120 125 



<210> 59 

<211> 471 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 52., 471 

<221> sigjpeptide 

<222> 52. .108 

<223> Von Heijne matrix 

score 10.8000001907349 
seq ILFLVAAATGAHS/QV 

<221> misc feature 
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<222> 210 
<223> n=a, g, 



c or t 



<400> 59 

acccaacaac cacatccctc ctcagmagcc cccagagcac aackcctyac c atg gac 57 

Met Asp 



tgg 


acc 


taa 


aoo 


ate 


etc 


ttt 


ttq 


Qta 

y ^y 


gca 


gca 


gcc 


aca 


yy ^- 


QCC 


cac 


105 


Trp 


Thr 




Ara 


He 


Leu 


Phe 


Leu 


Val 


Ala 


Ala 


Ala 


Thr 


Glv 


Ala 


His 








- 15 










-10 










-5 










tec 


cag 


gte 


cag 


ett 


gtg 


cag 


tet 


ggg 


get 


y o-y 


ata 
y ^y 


aag 


aag 


ect 




153 


Ser 


Gin 


Val 


Gin 


Leu 


Val 


Gin 


Ser 


Gly 


Ala 


Glu 


Val 


Lys 


Lys 


Pro 


Glv 






1 








5 










10 










15 




gcc 


tea 


n t" n 


aag 


at t 


tec 


tgc 


aag 


get 


tet 


yy a 


tac 


aye 


ttc 


act 


arv 


201 


Ala 


Ser 


V d ^ 


Lys 


Val 


Ser 


Cys 


Lys 


Ala 


Ser 


Glv 




Xaa 


Phe 


Thr 


Xaa 






















25 










30 






tmt 


get 


3 t- n 


cat 


tgg 


gtg 


egc 


cag 


gcc 


cec 


gga 




agr 


ett 




^yy 


249 


Xaa 


Ala 


Xaa 


His 




Val 


Arg 


Gin 


Ala 


Pro 


Glv 


Gin 


Arg 


Leu 


Glu 


Ti*r> 

X J. 










35 










40 










45 








atg ggr 


tgg 


ate 


aae 


get 


gcc 


amt 


ggt 


warn 


aca 


awa 


tat 


tea 


cag 


aas 


297 


Met Gly 


Trp 


He 


Asn 


Ala 


Ala 


Xaa 


Gly 


Xaa 


Thr 


Xaa 


Tyr 


Ser 


Gin 


Xaa 








50 










55 










60 










ttc 


cag 


grc 


aga 


gte 


acc 


wtt 


acc 


agg 


gac 


aca 


tec 


gcg 


age 


aca 


gte 


345 


Phe 


Gin 


Xaa 


Arg 


Val 


Thr Xaa Thr Arg 


Asp 


Thr 


Ser 


Ala 


Ser 


Thr 


Val 






65 










70 










75 












tec 


atg 


gag 


ctg 


age 


age 


ctg 


aga 


tet 


gaa 


gac 


aeg 


get 


gtg 


tat 


ttc 


393 


Ser 


tyiet 


Glu 


Leu 


Ser 


Ser 


Leu 


Arg 


Ser 


Glu 


Asp 


Thr 


Ala 


Val 


Tyr 


Phe 




80 










85 










90 










95 




tgt 


gcg 


aga 


gat 


tgg 


gaa 


att 


gca 


gta 


gta 


cca 


act 


get 


ata 


aac 


tet 


441 


Cys 


Ala 


Arg 


Asp 


Trp 


Glu 


He 


Ala 


Val 


Val 


Pro 


Thr 


Ala 


He 


Asn 


Ser 












100 










105 










110 






tac 


ggg 


ttc 


gac 


ect 


ggg 


gee 


agg 


gaa 


ect 














471 


Tyr Gly 


Phe 


Asp 


Pro 


Gly Ala Arg 


Glu 


Pro 

















115 120 



<210> 60 
<211> 348 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 193 , .348 

<221> sig_peptide 

<222> 193. .270 

<223> Von Heijne matrix 

score 10.8000001907349 

seq VLFLCVFLGMSWA/GA 

<400> 60 

agagcaaaga ggeaatctga agagaaaage ataggaaagg aaacagtggt aataggaatt 60 
ggggtaaaat gaggatcett ccccacaaac attgetatta ttcagctcat ttcaaaggat 120 
teegstgewg ccatttgtga gagcegctgg aggctgagtg aaagtcattt tgaaagaetg 180 
atccaaagaa ga atg gag gee aga gtg gag cgt get gtg cag aaa agg caa 231 
Met Glu Ala Arg Val Glu Arg Ala Val Gin Lys Arg Gin 
-25 -20 -15 

gte tta ttt ett tgt gta ttt ctg gga atg tet tgg get ggc gcc gaa 279 
Val Leu Phe Leu Cys Val Phe Leu Gly Met Ser Trp Ala Gly Ala Glu 

-10 -5 1 

ceg ett egg tat ttt gtg gcg gag gaa ace gag aga ggc ace tdk ett 327 
Pro Leu Arg Tyr Phe Val Ala Glu Glu Thr Glu Arg Gly Thr Xaa Leu 
5 10 15 
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acc aac ttg gca aaa gac eta 348 
Thr Asn Leu Ala Lys Asp Leu 
20 25 



<210> 61 

<211> 457 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 55. .456 



<221> sig_peptide 

<222> 55. .111 

<223> Von Heijne matrix 

score 10.8000001907349 
seq ILFLVAAATGAHS/QV 



<400> 61 

acccaaaaac cacacccctc cttgggagaa tcccctagat cacagctcct cacc atg 57 

Met 



gac tgg 


acc 


tgg 


age 


ate 


ctt 


tte 


ttg 


gtg 


gca gca gcg 


aca 


ggt 


gee 


105 


Asp Trp 


Thr 


Trp 


Ser 


He 


Leu 


Phe 


Leu Val 


Ala Ala Ala 


Thr 


Gly Ala 








-15 










-10 






-5 








cac tec 


cag 


gtt 


cag 


ctg 


gtg 


cag 


tct 


gga 


ggt gag gtg 


aag 


aag 


cct 


153 


His Ser 


Gin 

1 


Val 


Gin 


Leu 


Val 

5 


Gin 


Ser 


Gly 


Gly Glu Val 
10 


Lys 


Lys 


Pro 




ggg gcc 


tec 


gtc 


aag 


gtc 


tec 


tge 


aag 


get 


tct ggt tac 


acc 


ttt 


acc 


201 


Gly Ala 


Ser 


Val 


Lys 


Val 


Ser 


Cys 


Lys 


Ala 


Ser Gly Tyr 


Thr 


Phe 


Thr 




15 








20 










25 






30 




aga tat 


gat 


ate 


aac 


tgg 


gtg 


cga 


cag 


gcc 


cct gga caa 


ggg 


ctt 


gag 


249 


Arg Tyr 


Asp 


He 


Asn 


Trp 


Val 


Arg 


Gin 


Ala 


Pro Gly Gin Gly Leu Glu 










35 










40 






45 






tgg atg 


gga 


tgg 


ate 


age 


get 


dec 


aat 


ggt 


aac aca aat 


tat 


gca 


cag 


297 


Trp Met 


Gly 


Trp 
50 


He 


Ser 


Ala 


Xaa 


Asn 
55 


Gly 


Asn Thr Asn 


Tyr 
60 


Ala 


Gin 




daa gtc 


cag 


ggc 


aga 


gtc 


acc 


atg 


acc 


aca 


gac aca tec 


acg 


aga 


aca 


345 


Xaa Val 


Gin Gly 


Arg 


Val 


Thr 


Met Thr Thr Asp Thr Ser Thr Arg Thr 






65 










70 






75 










gcc tac 


atg 


gaa 


ctg 


agg 


age 


ctg 


cga 


tct 


gac gac acg 


gcc 


att 


tat 


393 


Ala Tyr 


Met 


Glu 


Leu 


Arg 


Ser 


Leu Arg 


Ser Asp Asp Thr Ala 


He 


Tyr 




80 










85 








90 










tac tgt 


gcg 


cga 


gag 


atm 


bta 


gtg 


gba 


sta 


tgt gat gga 


cag 


ttg 


ggg 


441 


Tyr Cys 


Ala 


Arg 


Glu 


He 


Xaa 


Val 


Xaa 


Xaa 


Cys Asp Gly Gin Leu Gly 




95 








100 










105 






110 




cca ggg 


aac 


ctg 


gtc 


a 


















457 


Pro Gly 


Asn 


Leu 


Val 
115 























<210> 62 

<211> 439 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 18. .437 



<221> sig_peptide 

<222> 18. .95 

<223> Von Heijne matrix 

score 10.8000001907349 
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seq FLliLVAAPRWVLS/QE 
<400> 62 

agtgcttyct gasagtc atg gac gtc ctg cac aaa cac atg aaa cac ctg 50 
Met Asp Val Leu His Lys His Met Lys His Leu 
-25 -20 
tgg ttc ttc etc etc ctg gtg gca get ccc aga tgg gtc ctg tec cag 98 
Trp Phe Phe Leu Leu Leu Val Ala Ala Pro Arg Trp Val Leu Ser Gin 
-15 "10 -5 1 

gag cag tta egg cag tgg ggc gca sga ctg ttg aag cct teg gag acc 146 
Glu Gin Leu Arg Gin Trp Gly Ala Xaa Leu Leu Lys Pro Ser Glu Thr 

5 10 15 

ctg tec etc acc tgt agt gtc tat ggt ggg tec ttc aat ggt tac tac 194 
Leu Ser Leu Thr Cys Ser Val Tyr Gly Gly Ser Phe Asn Gly Tyr Tyr 

20 25 30 

tgg age tgg ate ege cag tec cca ggg aag ggg ctg gag tgg att ggg 242 
Trp Ser Trp lie Arg Gin Ser Pro Gly Lys Gly Leu Glu Trp lie Gly 

35 40 45 

gga ate aat cac age gga age acc etc tec aac ecg tee etc aag agt 290 
Gly lie Asn His Ser Gly Ser Thr Leu Ser Asn Pro Ser Leu Lys Ser 
50 55 60 65 

cge gtc gac etc tea gtt gat geg tee aag gac cag gtg tee ctg agg 338 
Arg Val Asp Leu Ser Val Asp Ala Ser Lys Asp Gin Val Ser Leu Arg 

70 75 80 

ctg aaa ctt gtg acc gee geg gac acg get gtg tac ttc tgc gcg aga 386 
Leu Lys Leu Val Thr Ala Ala Asp Thr Ala Val Tyr Phe Cys Ala Arg 

85 90 95 

ccc eat tac gat atg teg act gat tet teg ttt gac ggt ttt gat etc 434 
Pro His Tyr Asp Met Ser Thr Asp Ser Ser Phe Asp Gly Phe Asp Leu 

100 105 110 

tgg gg 439 
Trp 

<210> 63 

<211> 214 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 82 . .213 

<221> sigjpeptide 

<222> 82. .126 

<223> Von Heijne matrix 

score 10.6999998092651 

seq LLALFFLLRIALA/SQ 

<400> 63 

accattggtg tgtctgtttt tatgccagta etgtgatgtt ttggttatat agctttgtaa 60 
tatattttga agccagatag t atg atg ctt eta get ttg ttc ttt ttg ett 111 

Met Met Leu Leu Ala Leu Phe Phe Leu Leu 
-15 -10 
agg att get ttg get agt caa ggt ett ttg tgg ttc cat aca aat ttt 159 
Arg lie Ala Leu Ala Ser Gin Gly Leu Leu Trp Phe His Thr Asn Phe 
-5 15 10 

aag gtt ttt gtt gtt tey att tgt gtg aag act ate att ggg att teg 207 
Lys Val Phe Val Val Ser He Cys Val Lys Thr He He Gly He Ser 

15 20 25 

999 99c a 214 
Gly Gly 



<210> 64 
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<211> 297 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 63. .296 



<221> sigjpeptide 

<222> 63. .119 

<223> Von Heijne matrix 

score 10.6999998092651 
seq ILFLVAAATGALS/QV 



<400> 64 









-15 










■10 








-5 


gga 


gcc 


etc tec 


cag gtg 


cag 


ctg 


gtr 


cag 


tct 


gga ggt 


gar 


gtg 


aag 


Gly Ala 


Leu Ser 


Gin Val 

1 


Gin 


Leu 


Val 

5 


Gin 


Ser 


Gly Gly 


Glu 
10 


Val 


Lys 


aag 


cct 


ggg gcc 


tea gtg 


agg 


gtc 


tec 


tgc 


aag 


gcc tct 


gga 


tac 


age 


Lys 


Pro 


Gly Ala 


Ser Val 


Arg 


Val 


Ser Cys 


Lys 


Ala Ser 


Gly 


Tyr 


Ser 






15 






20 








25 








ttc 


ate 


ggc tat 


tat gta 


cac 


tgg 


ata 


cga 


cag 


act cct 


ggg 


cga 


sgc 


Phe 


He 


Gly Tyr 


Tyr Val 


His 


Trp 


He 


Arg 


Gin 


Thr Pro 


Gly 


Arg 


Xaa 




30 






35 










40 








ctt 


gag 


tgg atg 


ggg tgg 


gtc 


aac 


cct 


crs 


act 


ggc gac 


aac 


ggg 


g 


Leu 


Glu 


Trp Met 


Gly Trp Val 


Asn 


Pro 


Xaa 


Thr 


Gly Asp 


Asn 


Gly 




45 






50 










55 











60 



gtgcatcacc cagcaaccac atctgtcctc tagagaatcc cctgagadht ccgttcctca 

cc atg gac tgg acc tgg agg ate etc ttc ttg gtg gca ger gcc aca 107 
Met Asp Trp Thr Trp Arg He Leu Phe Leu Val Ala Ala Ala Thr 



155 



203 



251 



297 



<210> 65 
<211> 370 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 237. .368 



<221> sig_j>eptide 

<222> 237. .347 

<223> Von Heijne matrix 

score 10.6000003814697 
seq YLLLVLSLSLSLC/CS 



<400> 65 

aaaggtacac aattgaaaaa aattgtatcc ttcacaacag atgtgggcag teaactttta 60 
gaccttgtgt ctttagtttg acctgtcctt cagtgagtgt atataaaatt ctaagctaaa 120 
acatattttc tgaaattgtg aaggtattgc atgtctatct tcttgcctac tctaaatata 180 
tcaatcgttt tcttgggaag ttagtctttc tttcacactt gtctgtagat ctttac atg 239 

Met 

ttc ttt cag ttt tgg aag tec tct gca tat tta ata ttt gtt agt att 287 
Phe Phe Gin Phe Trp Lys Ser Ser Ala Tyr Leu He Phe Val Ser He 

-35 -30 -25 

tgt aaa ggt ttt ctt cct gtc tac etc ctt ctt gtt etc tct etc tct 335 
Cys Lys Gly Phe Leu Pro Val Tyr Leu Leu Leu Val Leu Ser Leu Ser 
-20 '15 -10 -5 

etc tct etc tgt tgc tct etc ttg etc tct etc ea 370 
Leu Ser Leu Cys Cys Ser Leu Leu Leu Ser Leu 
1 5 
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<210> 66 
<211> 428 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 45. .428 



<221> sig_peptide 

<222> 45. .101 

<223> Von Heijne matrix 

score 10.6000003814697 
seq ILFLVAAATGVHS/QV 

<221> misc_feature 
<222> 342. .343 
<223> n=a, g, c or t 



<400> 66 

aaccacmycc ctcctcagaa gcccccagag cacaacgcct cacc atg gac tgg acc 56 

Met Asp Trp Thr 

tgg agg ate etc ttt ttg gtg gca gca gcc aca ggt gtc cac tec cag 104 

Trp Arg He Leu Phe Leu Val Ala Ala Ala Thr Gly Val His Ser Gin 

-IS -10 -5 1 

gtc cac ctt gtt cag tct ggg get gar gtg aag aag cct ggg act ccg 152 

Val His Leu Val Gin Ser Gly Ala Glu Val Lys Lys Pro Gly Thr Pro 

5 10 15 

gtg aac att tec tgt aag get ttt ggc tac acc ttc cct gcc ttt get 200 
Val Asn He Ser Cys Lys Ala Phe Gly Tyr Thr Phe Pro Ala Phe Ala 
20 25 30 

248 



296 



ata 


cat 


tgg 


gtt 


cgc 


cag 


gcc ccc 


gga 


caa 


agt 


ctt 


gag 


tgg 


atg gga 


He 


His 


Trp 


Val 


Arg 


Gin 


Ala Pro 


Gly 


Gin 


Ser 


Leu 


Glu 


Trp 


Met 


Gly 




35 










40 








45 










tgg 


gtc 


aac 


att 


ggc 


cat 


ggc aac 


aca 


aag 


tat 


tea 


cag 


aag 


ttt 


cag 


Trp 


Val 


Asn 


He 


Gly His 


Gly Asn 


Thr 


Lys 


Tyr 


Ser 


Gin 


Lys 


Phe 


Gin 


50 










55 








60 










65 


ggc 


aga 


etc 


gcc 


ate 


tec 


aga gac 


acg 


tec 


gcg 


aac 


ata 


gtc 


tac 


nng 


Gly Arg 


Leu 


Ala 


He 


Ser 


Arg Asp 


Thr 


Ser 


Ala 


Asn 


He 


Val 


Xyr 


Xaa 










70 








75 










80 




gaa 


ctg 


age 


ggc 


ctg 


aga 


tct gaa 


gac 


acg 


get 


gtc 


tat 


tac 


tgt 


gcg 


Glu 


Leu 


Ser 


Gly 


Leu 


Arg 


Ser Glu 


Asp 


Thr 


Ala 


Val 


Tyr 


Tyr Cys 


Ala 








85 








90 










95 






agg 


gat 


aat 


ctt 


ttc 


ttt 


ggc agt 


atg 


ggc 


ttt 


gac 










Arg Asp 


Asn 


Leu 


Phe 


Phe 


Gly Ser 


Met 


Gly 


Phe 


Asp 











344 



392 



428 



100 105 



<210> 67 

<211> 493 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 38. .493 



<221> sig_peptide 

<222> 38. .85 

<223> Von Heijne matrix 

score 10.6000003814697 
seq TVLLLGLLSHCTG/SV 
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<400> 67 

ctgggcctaa ggaagcagca ctggtggtgc ctcagcc atg gcc tgg acc gtt etc 55 

Met Ala Trp Thr Val Leu 
-15 



etc 


etc 


QQC 


etc 


etc 


tct 


cac 


tgc 


aca 


ggc 


tct 


gtg acc 


tec 


tat 


gtg 


103 


Leu 


Leu 


Gly 


Leu 


Leu 


Ser 


His 


Cys 


Thr 


Gly Ser Val Thr 


Ser 


Tyr 


Val 




-10 










-5 










1 






5 






eta 


act 


eag 


cct 


ccc 


teg 


Qtq 


tea 


QtQ 


gcc 


cca 


gga aag 


aeg 


gcc 


age 


151 


Leu 


Thr 


Gin 


Pro 


Pro 


Ser 


Val 


Ser 


Val 


Ala 


Pro Gly Lys 


Thr 


Ala 


Ser 










10 










15 








20 








att 


acc 


tgt 


ggg 


gga 


gac 


aac 


att 


gaa 


agt 


caa 


gtt gta 


cac 


tgg 


cac 


199 


lie 


Thr 


Cys 


Gly Gly Asp 


Asn 


He 


Glu 


Ser Gin Val Val His Trp His 








25 










30 








35 










eag 


eag 


aag 


cca 


ggg 


cag 


QCC 


cct 


ata 


ttg 


gtc 


ate tat 


gat 


gat 


acc 


247 


Gin 


Gin 


Lys 


Pro Gly Gin 


Ala 


Pro 


He 


Leu 


Val 


He Tyr Asp Asp Thr 
















45 










50 










gac 




CCC 


tea 


ggg 


ate 


cct 


gac 


cga 


ttc 


tct 


ggc tec 


aac 


tct 


ggg 


295 


Asp 


Arg 


Pro Ser Gly 


lie 


Pro 


Asp 


Arg 


Phe 


Ser 


Gly Ser 


Asn 


Ser Gly 




bb 










60 










65 








70 




cac 


a eg 


gcc 


acc 


ctg 


acc 


ate 


age 


agg 


gtc 


gaa 


gee ggg 


gat 


gag 


gcc 


343 


His 


Thr 


Ala 


Thr 


Leu 


Thr 


He 


Ser Arg 


Val 


Glu 


Ala Gly Asp Glu Ala 












75 










80 








85 






gac 


tat 


tat 


tgt 


cag 


gtg 


tgg 


gat 


aga 


agt 


agt 


ggt cag 


gga 


ata 


ttc 


391 


Asp 


Tyr 


Tyr 


Cys 


Gin 


Val 


Trp 


Asp Arg 


Ser 


Ser 


Gly Gin Gly 


He 


Phe 










90 










95 








100 








ggc 


gga 


ggg 


acc 


aag 


ctg 


acc 


gtc 


eta 


cgt 


cag 


ccc aag 


get 


gee 


ccc 


439 


Gly 


Gly 


Gly Thr 


Lys 


Leu 


Thr 


Val 


Leu 


Arg 


Gin 


Pro Lys 


Ala 


Ala 


Pro 








105 










110 








115 










teg 


gtc 


act 


ctg 


ttc 


ccg 


ccc 


tec 


tct 


gag 


gag 


ctt caa 


gcc 


aac 


aag 


487 


Ser 


Val 

120 


Thr 


Leu 


Phe 


Pro 


Pro 
125 


Ser 


Ser 


Glu 


Glu 


Leu Gin 
130 


Ala 


Asn 


Lys 





gcc aca 493 

Ala Thr 

135 

<210> 68 

<211> 180 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 36. .179 

<221> sig_peptide 

<222> 36, .80 

<223> Von Heijne matrix 

score 10.6000003814697 

seq LLFLLLFVCFSRQ/GL 

<400> 68 

tggcagttac tccagctcce aaatatagat attce atg agg ttg ttg ttt ttg 53 

Met Arg Leu Leu Phe Leu 
-15 -10 
ttg ttg ttt gtt tgt ttt teg aga cag ggt etc get ttg tct etc agg 101 
Leu Leu Phe Val Cys Phe Ser Arg Gin Gly Leu Ala Leu Ser Leu Arg 

-5 15 
ctg gaa tgc agt ggt atg ate atg get tac tgc age ate age etc cca 149 
Leu Glu Cys Ser Gly Met He Met Ala Tyr Cys Ser He Ser Leu Pro 

10 15 20 

ggc tea age agt cct etc acc tea gee tee a 180 
Gly Ser Ser Ser Pro Leu Thr Ser Ala Ser 
25 30 
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<210> 69 

<211> 259 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 38. .259 

<221> sigjpeptide 

<222> 38. .94 

<223> Von Heijne matrix 

score 10.6000003814697 

seq FLLLVSAPRWVLS/QV 

<400> 69 

atacttyctg agagtcctgg acctcctgca caagaac atg aaa cac ctg tgg ttc 55 

Met Lys His Leu Trp Phe 
-15 

ttc etc etc ctg gtg tea get ecc aga tgg gtc ctg tet cag gtg cag 103 
Phe Leu Leu Leu Val Ser Ala Pro Arg Trp Val Leu Ser Gin Val Gin 

-10 -5 1 

eta cag gag teg ggc cca gga ctg gtg aag cct teg ggg agg ctg tec 151 
Leu Gin Glu Ser Gly Pro Gly Leu Val Lys Pro Ser Gly Arg Leu Ser 

5 10 15 

etc gee tgc gat gtg gtg gaa ttg agt ccg ccg gee ecc agg ggc ggg 199 
Leu Ala Cys Asp Val Val Glu Leu Ser Pro Pro Ala Pro Arg Gly Gly 
20 25 30 35 

tet gea gtg cat etc aga aat ctt tea tea tgg gag cee cac eta caa 247 
Ser Ala Val His Leu Arg Asn Leu Ser Ser Trp Glu Pro His Leu Gin 

40 45 50 

ecc gtc teg ggg 259 
Pro Val Ser Gly 

55 

<210> 70 

<211> 178 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 7. .177 

<221> sig_peptide 

<222> 7. .102 

<223> Von Heijne matrix 

score 10.6000003814697 

seq WFLLLLVSTLSS/W 

<400> 70 

cgtata atg act tac ttt cct ctg ggt aga tac cca gta atg gga ttg 48 
Met Thr Tyr Phe Pro Leu Gly Arg Tyr Pro Val Met Gly Leu 
-30 -25 -20 

ctg gat caa atg gta gtt gtg ttt tta ctt ctt tta gtc tec aca ctt 96 
Leu Asp Gin Met Val Val Val Phe Leu Leu Leu Leu Val Ser Thr Leu 

-15 -10 -5 

tet tec gta gtg gtt tta eta gtt tgc att ecc ace age agt gta aaa 144 
Ser Ser Val Val Val Leu Leu Val Cys He Pro Thr Ser Ser Val Lys 

15 10 
ttg ttc cct ttt cac cat ate cac ace aac tgg g 178 
Leu Phe Pro Phe His His He His Thr Asn Trp 
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15 20 25 

<210> 71 

<211> 131 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 40. .129 

<221> sig_j)eptide 
<222> 40. .96 
<223> Von Heijne matrix 
score 10.5 

seq WVLLVAMLRGLQC/QV 
<400> 71 

agctctggga gacgagccca gctctgcagt ggactcacc atg gag ttt ggg ctg 54 

Met Glu Phe Gly Leu 
-15 

age tgg gtt etc etc gtt get atg tta aga ggt etc cag tgt caa gtg 102 
Ser Trp Val Leu Leu Val Ala Met Leu Arg Gly Leu Gin Cys Gin Val 

-10 -5 1 

cag ctg gtg gag tct ggg gga acc gcg gg 131 
Gin Leu Val Glu Ser Gly Gly Thr Ala 
5 10 

<210> 72 

<211> 217 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 47. .217 

<22l> sig_peptide 
<222> 47. .91 
<223> Von Heijne matrix 
score 10.5 

seq LSLLILLLENVSG/FP 
<400> 72 

ttgcttaeaa ttttaatgtg tetcattgct actggtcetc ettcta atg tat ctg 55 

Met Tyr Leu 
-15 

age ttg tta att eta ctt ttg gaa aat gtc agt ggc ttt ccc ttt cet 103 
Ser Leu Leu lie Leu Leu Leu Glu Asn Val Ser Gly Phe Pro Phe Pro 

-10 -5 1 

eta att ttc cag ctt cat gea tec cet ggc eat aag ata ctt cea gac 151 
Leu lie Phe Gin Leu His Ala Ser Pro Gly His Lys lie Leu Pro Asp 
5 10 15 20 

tgt atg ata tat tct ate act gtc age ctt atg ttc cet gtg gtt gac 199 
Cys Met lie Tyr Ser He Thr Val Ser Leu Met Phe Pro Val Val Asp 

25 30 35 

tat ata age aeg caa ggg 217 
Tyr He Ser Thr Gin Gly 
40 



<210> 73 
<211> 192 
<212> DNA 



wo 99/53051 PCT/1B99/00712 

50 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 100. .192 



<221> sig_peptide 
<222> 100. .183 
<223> Von Heijne matrix 
score 10.5 

seq SLWFXCLLFLLFA/WP 



<400> 73 

agttaaaatc atgtactgtg atcagtcacc tggtttttga tttttatgaa ggtttttttt 60 
gtttagatag ttgttaaatt tggtgttcct gtggggagg atg atg aga gcc ttc 114 

Met Met Arg Ala Phe 
-25 

tat ttg get ate ttg ttc tgc etc tct etc tee tta tgg ttc tdk tgt 162 
Tyr Leu Ala lie Leu Phe Cys Leu Ser Leu Ser Leu Trp Phe Xaa Cys 

-20 -15 -10 

tta ctt ttt ttg ctt ttt get tgg cct ggg 192 
Leu Leu Phe Leu Leu Phe Ala Trp Pro Gly 
-5 1 



<210> 74 

<211> 329 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 23. .328 



<221> sig_peptide 

<222> 23 . .82 

<223> Von Heijne matrix 

score 10.3999996185303 

seq FLTLLLHCTGSLA/QL 

<400> 74 

agagctctgg ggagtctgea ee atg get tgg ace cea etc etc ttc etc ace 52 

Met Ala Trp Thr Pro Leu Leu Phe Leu Thr 
-20 -15 
etc etc etc cac tgc aca ggg tct etc gcc cag ctt gtg ctg act caa 100 
Leu Leu Leu His Cys Thr Gly Ser Leu Ala Gin Leu Val Leu Thr Gin 
-10 -5 15 

teg ecc tct gcc tct gcc tec ctg gga gee teg gte aag etc acc tgc 148 
Ser Pro Ser Ala Ser Ala Ser Leu Gly Ala Ser Val Lys Leu Thr Cys 

10 15 20 

act ctg age agt ggg cac age aac tac gge ate get tgg tat cag cag 196 
Thr Leu Ser Ser Gly His Ser Asn Tyr Gly lie Ala Trp Tyr Gin Gin 

25 30 35 

cag cea gag aag gge cct cga ttc ttg atg aaa gtt aac agt gat gge 244 
Gin Pro Glu Lys Gly Pro Arg Phe Leu Met Lys Val Asn Ser Asp Gly 

40 45 50 

age cac atg aag gcg gae ggg ate cct gat cgc ttc tea gge tec age 292 
Ser His Met Lys Ala Asp Gly lie Pro Asp Arg Phe Ser Gly Ser Ser 
55 60 65 70 

tct ggg get gag cgc tac etc tec ate tec age etc a 329 
Ser Gly Ala Glu Arg Tyr Leu Ser lie Ser Ser Leu 
75 80 



<210> 75 
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51 

<211> 314 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 259- .312 



<221> sigjpeptide 

<222> 259. -300 

<223> Von Heijne matrix 

score 10.3999996185303 
seq PLALFFLLSVALA/IQ 



<400> 75 

tagggtgaga gatggggatc tagttttatt cttctgcata tggatatcca gttttcccag 60 

taacatttat tgaagagact ggcctttccc caatgagtgt tcttggcacc tttgtcaaaa 120 
gtcagttggc cgtagatatg tggattaatt tctgtgttcc ctgttttgtt ccattggcct 180 
atgtgtctgt ttttatgaca gtaccaggtt gttttggtta ctacagcttt gtagtttact 240 
ttgaggtctg ttagtgtg atg cct eta get ttg tte ttt ttg cte agt gtt 291 

Met Pro Leu Ala Leu Phe Phe Leu Leu Ser Val 
-10 -5 
get ttg get att cag ggt cag gg 314 
Ala Leu Ala lie Gin Gly Gin 
1 



<210> 76 
<211> 447 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 59. .445 



<221> sig_peptide 

<222> 59. .115 

<223> Von Heijne matrix 

score 10.3999996185303 
seq XFCLLAVAPGAHS/QV 



<400> 76 

atcatccaac aaccacatee cttctctaea gaageetetg agaggaaagt tettcacc 58 
atg gac tgg acc tgg agg rwc tte tgc ttg ctg get gta get oca ggt 106 
Met Asp Trp Thr Trp Arg Xaa Phe Cys Leu Leu Ala Val Ala Pro Gly 

-15 -10 -5 

get cac tec cag gtg cag ctg gtg cag tct ggg get gag gtg aag aag 154 
Ala His Ser Gin Val Gin Leu Val Gin Ser Gly Ala Glu Val Lys Lys 

15 10 
cct ggg gee tea gtg aag gtt tee tge aag gea tet gga tae ace tte 202 
Pro Gly Ala Ser Val Lys Val Ser Cys Lys Ala Ser Gly Tyr Thr Phe 

15 20 25 

aee age eae tat atg eae tgg gtg ega eag gee ect gga caa ggg ett 250 
Thr Ser His Tyr Met His Trp Val Arg Gin Ala Pro Gly Gin Gly Leu 
30 35 40 45 

gag tgg atg gga ata ate tae eet gat agt gat acc act aag tae cba 298 
Glu Trp Met Gly lie lie Tyr Pro Asp Ser Asp Thr Thr Lys Tyr Xaa 

50 55 60 

cag aac tte cag ggc aga gtc acc atg act agg gac acg tec acg age 346 
Gin Asn Phe Gin Gly Arg Val Thr Met Thr Arg Asp Thr Ser Thr Ser 

65 70 75 

aea gtc tae atg gag ctg age age etg aea tet gae gac acg gee gtg 394 
Thr Val Tyr Met Glu Leu Ser Ser Leu Thr Ser Asp Asp Thr Ala Val 
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80 85 90 

tat tat tgt get aga gag gcg tat agt ggg age tac cgc ttt gac tac 442 
Tyr Tyr Cys Ala Arg Glu Ala Tyr Ser Gly Ser Tyr Arg Phe Asp Tyr 

95 100 105 

tgg gg 44 7 

Trp 
110 

<210> 77 
<211> 388 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16. .387 

<221> sig_peptide 

<222> 16. . 93 

<223> Von Heijne matrix 

seore 10.3000001907349 

seq LLLLVAAPRWVLS/QL 

<400> 77 

agctttctga gagtc atg gat etc atg tgc aag aaa atg aga cac ctg tgg 51 
Met Asp Leu Met Cys Lys Lys Met Arg His Leu Trp 
-25 -20 -15 

ttc etc etc ctg etg gtg gcg get ccc aga tgg gtc ctg tec cag ctg 99 
Phe Leu Leu Leu Leu Val Ala Ala Pro Arg Trp Val Leu Ser Gin Leu 

-10 -5 1 

cag ctt cag gag teg ggc eca gga ctg gtg aag get teg gag acc ctg 147 
Gin Leu Gin Glu Ser Gly Pro Gly Leu Val Lys Ala Ser Glu Thr Leu 

5 10 15 

tec etc gee tgc agt gtc tct ggt gac tee ate age agt ggt aat tat 195 
Ser Leu Ala Cys Ser Val Ser Gly Asp Ser lie Ser Ser Gly Asn Tyr 

20 25 30 

tac tgg ggc tgg ate egg eag eee eea ggg aag gga ctg cag tgg ctt 243 
Tyr Trp Gly Trp lie Arg Gin Pro Pro Gly Lys Gly Leu Gin Trp Leu 
35 40 45 50 

ggg agt ctt tgg aat cgt ggc ggt ccg caa tac aay hcc tec etc aag 291 
Gly Ser Leu Trp Asn Arg Gly Gly Pro Gin Tyr Asn Xaa Ser Leu Lys 

55 60 65 

aat cga gtc acc gtg tee gta gac acg tec acg aat cat ttc ttt ctg 339 
Asn Arg Val Thr Val Ser Val Asp Thr Ser Thr Asn His Phe Phe Leu 

70 75 80 

aga etg aat tec gtg aay vgh gga cac ggc aat tta tta ctg tgc gcg a 388 
Arg Leu Asn Ser Val Asn Xaa Gly His Gly Asn Leu Leu Leu Cys Ala 
85 90 95 

<210> 78 
<211> 121 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 25. .120 

<221> sig_j)eptide 

<222> 25. .72 

<223> Von Heijne matrix 

seore 10.1999998092651 

seq XLXLSVLLGXXXX/KX 
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<400> 78 

ctgaccttcc ctcccggcag cage atg egg grg tkg etg kys ttg agt gtc 51 

Met Arg Xaa Xaa Leu Xaa Leu Ser Val 
-15 -10 
etg ttr ggg sck rts tbt kge aag gmg gae ttt gtg ggg cat cag gtg 99 
Leu Leu Gly Xaa Xaa Xaa Xaa Lys Xaa Asp Phe Val Gly His Gin Val 

-5 15 
etc cga ate tct gta gee gat g 121 
Leu Arg lie Ser Val Ala Asp 
10 15 

<210> 79 
<211> 149 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 36. .149 

<221> sig__peptide 

<222> 36. .143 

<223> Von Heijne matrix 

score 10.1999998092651 

seq FLLFFCFVFCLRG/QG 

<400> 79 

tccgagcage cagctctgtg taagcacatc cagga atg gca gaa tec agg gag 53 

Met Ala Glu Ser Arg Glu 
-35 

gaa ggt gaa age tgt gtt gag age eae tgt gtg etc ttt ttc ace etg 101 
Glu Gly Glu Ser Cys Val Glu Ser His Cys Val Leu Phe Phe Thr Leu 
-30 -25 -20 -15 

ttt ttt ttg ttg ttt ttt tgt ttt gtt ttt tgt ttg agg gga cag ggg 149 
Phe Phe Leu Leu Phe Phe Cys Phe Val Phe Cys Leu Arg Gly Gin Gly 
-10 -5 1 

<210> 80 
<211> 410 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 80. .409 

<221> sig__peptide 

<222> 80. .136 

<223> Von Heijne matrix 

score 10.1000003814697 

seq WLFLVAFLKGVQC/EV 

<400> 80 

agetctgaga gaggagccea gcectgggat tttcaggtgt tttcatgtgg tgatcaggac 60 
tgaacagaga gaactcacc atg gag ctt ggg etg age tgg ctt ttt ctt gtg 112 

Met Glu Leu Gly Leu Ser Trp Leu Phe Leu Val 
-15 -10 
get ttt tta aaa ggt gtc cag tgt gag gtg cag ttg ttg gag tct ggg 160 
Ala Phe Leu Lys Gly Val Gin Cys Glu Val Gin Leu Leu Glu Ser Gly 

-5 15 
gga ggc ttg gtc cag cct ggg ggg tec etg aga etc tea tgt gca gcc 208 
Gly Gly Leu Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Ala 
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10 

tec gga ttc acc ttt age 
Ser Gly Phe Thr Phe Ser 
25 30 
cca ggt aag ggg ctg gag 
Pro Gly Lys Gly Leu Glu 
45 

gat aca tat gat gca gac 
Asp Thr Tyr Asp Ala Asp 
60 

gac gat tec aag aaa ate 
Asp Asp Ser Lys Lys lie 
75 

gag gac agg c 
Glu Asp Arg 
90 



54 

15 20 
tec tat gee atg etc tgg 
Ser Tyr Ala Met Leu Trp 

35 

tgg 9tc tea ggt att agt 
Trp Val Ser Gly lie Ser 
50 

tec gtg aag gge egg ttc 
Ser Val Lys Gly Arg Phe 
65 

eta tat eta caa atg aac 
Leu Tyr Leu Gin Met Asn 
80 



gtc egc eag get 256 
Val Arg Gin Ala 
40 

get ggt get gat 304 
Ala Gly Ala Asp 
55 

acc att tec aga 352 
Thr lie Ser Arg 

70 

age ctg aga gee 400 

Ser Leu Arg Ala 

85 

410 



<210> 81 

<211> 219 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 38. .217 



<221> sig_peptide 

<222> 38, .106 

<223> Von Heijne matrix 

score 10.1000003814697 
seq VLGLLVLFLTCYA/DD 



<400> 81 

gaacaattta tctgcacgaa taeccegtge taccaga atg get gtc tea gta ctt 55 

Met Ala Val Ser Val Leu 
-20 

egc ctg aca gtt gtc ctg gga ctg ctt gtc tta ttc ctg acc tgc tat 103 
Arg Leu Thr Val Val Leu Gly Leu Leu Val Leu Phe Leu Thr Cys Tyr 

-15 -10 -5 

gea gac gac aaa eca gac aag eea gac gac aag eea gae gac teg gge 151 
Ala Asp Asp Lys Pro Asp Lys Pro Asp Asp Lys Pro Asp Asp Ser Gly 

15 10 15 

aaa gae eea aag cca gac ttc ccc aaa ttc eta age etc ctg gge aca 199 
Lys Asp Pro Lys Pro Asp Phe Pro Lys Phe Leu Ser Leu Leu Gly Thr 

20 25 30 

gag ate att gag aat geg gg 219 
Glu He He Glu Asn Ala 
35 



<210> 82 

<211> 399 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 81. .398 



<221> sig_peptide 
<222> 81. .152 
<223> Von Heijne matrix 
score 10 

seq LLLLQALPSPLSA/RA 
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55 



<400> 82 

gaagaagagg gtagaggagg agagggagga ggaggaggga ggtggcggcg ccgtggcgga 60 
ggagcaggag caggaggggg atg gag agg aga agg etc ctg ggt ggc atg gcg 113 

Met Glu Arg Arg Arg Leu Leu Gly Gly Met Ala 
-20 -15 



etc 


etg 


etc 


etc 


eag geg 


etg eec age cce 


ttg 


tea 


gcc 


agg 


get 


gaa 


161 


Leu 


Leu 


Leu 


Leu 


Gin Ala 


Leu Pro Ser Pro 


Leu 


Ser 


Ala 


Arg 


Ala 


Glu 










-10 




-5 








1 








ccc 


ccg 


cag 


gat 


aag gaa 


gcc tgt gtg ggt 


acc 


aac 


aat 


caa 


age 


tac 


209 


Pro 


Pro 


Gin 


Asp 


Lys Glu Ala Cys Val Gly Thr 


Asn 


Asn 


Gin 


Ser 


Tyr 






5 








10 




15 












ate 


tgt 


gac 


aca 


gga cac 


tge tgt gga cag 


tct 


cag 


tge 


tgy 


aac 


tac 


257 


He 


Cys 


Asp 


Thr 


Gly His 


Cys Cys Gly Gin 


Ser 


Gin 


Cys 


Cys 


Asn 


Tyr 




20 








25 




30 










35 




tac 


tat 


gaa 


etc 


tgg tgg 


ttc tgg ctg gtg 


tgg 


ace 


ate 


ate 


ate 


ate 


305 


Tyr 


Tyr 


Glu 


Leu 


Trp Trp 


Phe Trp Leu Val 


Trp 


Thr 


He 


He 


He 


He 












40 


45 










50 






etg 


age 


tge 


tge 


tgt gtt 


tge cac cac cgc 


eg a 


gee 


aag 


cac 


cgc 


ett 


353 


Leu 


Ser 


Cys 


Cys 


Cys Val 


Cys His His Arg 


Arg 


Ala 


Lys 


His 


Arg 


Leu 










55 




60 








65 








cag 


gcc 


cag 


cag 


egg caa 


cat gaa ate aac 


ctg 


ate 


get 


tac 


ega 


g 


399 


Gin 


Ala 


Gin 


Gin 


Arg Gin His Glu He Asn Leu 


He 


Ala 


Tyr 


Arg 










70 






75 






80 











<210> 83 
<211> 398 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 288. .398 



<221> sig_peptide 

<222> 288. .368 

<223> Von Heijne matrix 

score 9.89999961853027 
seq LCLLLFSLSLFLC/HE 



<400> 83 

cactctaect ctgaeageat gtatattgea ceagtagcta aeaaaaactg gtetagtcaa 60 
accaaatggg cacaaaagaa ceaggatace aaaagttaag ctcatacagc tgeaaaceat 12 0 
atcacttctt ggtaacaatg caagacctca taaacctaaa gaagagaaag aaaagaaaac 180 
ttttgttact ttvctttttt gcttgtcact tatatacagg ctatgtgaga atataatttg 240 
taggtataac acattaagaa aaagttatct tcattggata gaattga atg gtg gtc 296 

Met Val Val 
-25 

get gat agg aat agg geg tee tct age tet tat etc tgt etc tta etc 344 
Ala Asp Arg Asn Arg Ala Ser Ser Ser Ser Tyr Leu Cys Leu Leu Leu 

-20 -15 -10 

ttt tct ctt tct ctt ttt etc tgt cat gag act gtg tgt gac agg gcc 392 
Phe Ser Leu Ser Leu Phe Leu Cys His Glu Thr Val Cys Asp Arg Ala 

-5 15 
ace tgt 398 
Thr Cys 
10 



<210> 84 
<211> 488 
<212> DNA 

<213> Homo sapiens 



wo 99/53051 

<220> 
<221> CDS 
<222> 62. .487 

<221> sig_peptide 
<222> 62. .118 
<223> Von Heijne matrix 

score 9.89999961853027 

seq FLFWAAATGVQS/QV 

<221> tnisc_f eature 

<222> 210,293 

<223> n=a, g, c or t 

<400> 84 

agcatcacat aacaaccaca ttcctcctct aaagaagccc ctgggagcac agctcatcac 60 
c atg gac tgg acc tgg agg ttc etc ttt gtg gtg gca gca get aea ggt 109 
Met Asp Trp Thr Trp Arg Phe Leu Phe Val Val Ala Ala Ala Thr Gly 
-15 -10 -5 



gtc 


cag 


tec cag 


gtg 


cag 


ctg 


gtg 


cag tct ggg 


get 


gag 


gtg 


aag 


aag 


157 


Val 


Gin 


Ser Gin 
1 


Val 


Gin 


Leu 


Val 
5 


Gin Ser Gly 


Ala 


Glu 
10 


Val 


Lys 


Lys 




cet 


ggg 


tec teg 


gtg 


aag 


gtc 


tec 


tgc aag get 


tct 


gga 


ggc 


acc 


ttc 


205 


Pro 


Gly 


Ser Ser 


Val 


Lys 


Val 


Ser 


Cys Lys Ala 


Ser 


Gly 


Gly 


Thr 


Phe 






15 








20 






25 












age 


anc 


tat get 


ate 


age 


tgg 


gtg 


ega cag gee 


cet 


gga 


caa 


ggg 


ett 


253 


Ser 


Xaa 


Tyr Ala 


He 


Ser 


Trp 


Val 


Arg Gin Ala 


Pro 


Gly 


Gin 


Gly Leu 




30 








35 






40 










45 




gag 


tgg 


citg gga 


ggg 


ate 


ate 


ect 


ate ttt ggt 


aca 


gca 


nac 


tac 


gca 


301 


Glu 


Trp 


Met Gly 


Gly 


He 


He 


Pro 


He Phe Gly 


Thr 


Ala 


Xaa 


Tyr Ala 










50 








55 








60 






cag 


aag 


ttc cag 


ggc 


aga 


gtc 


acs 


att acc gcg 


gac 


gra 


tec 


acg 


asc 


349 


Gin 


Lys 


Phe Gin 


Gly Arg 


Val 


Thr 


He Thr Ala 


Asp 


Xaa 


Ser 


Thr 


Xaa 








65 










70 






75 








aca 


roc 


tac atg 


gag 


ctg 


age 


age 


ctg aga tct 


gag 


gac 


acg 


gee 


stg 


397 


Thr 


Xaa 


Tyr Met 


Glu 


Leu 


Ser 


Ser 


Leu Arg Ser 


Glu 


Asp 


Thr 


Ala 


Xaa 








80 








85 






90 










tat 


tac 


tgt gcg 


aga 


ggt 


caa 


gee 


cec ggt agg 


gta 


gta 


gta 


cea 


ett 


445 


Tyr 


Tyr 


Cys Ala 


Arg Gly 


Gin 


Ala 


Pro Gly Arg 


Val 


Val 


Val 


Pro 


Leu 






95 








100 






105 












ttc 


etc 


tgg ggc 


cag 


gga 


ace 


tgg 


tea ccg tct 


cet 


cag 


ect 


e 




488 


Phe 


Leu 


Trp Gly 


Gin Gly 


Thr 


Trp 


Ser Pro Ser 


Pro 


Gin 


Pro 








110 








115 






120 















<210> 85 

<211> 290 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 30. .290 

<221> sig_peptide 

<222> 30. .164 

<223> Von Heijne matrix 

score 9.89999961853027 

seq LLSLLSFLDETSG/LS 

<400> 85 

cttettttte ttegtaaett catggeaac atg acc tac agt tac tea ttt ttc 53 

Met Thr Tyr Ser Tyr Ser Phe Phe 



PCT/1B99/00712 

56 
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-Art 






agg 


cct 




ucg 




gcc 






ctt 


aat 


tat 




cat 




gaa 




Arg 


Pro 


Glu 


Leu 


lie 


Val 


Asn 


His 
-30 


Leu 


Asn 


Tvr 


Val 


His 
-25 


Ser 


Glu 


Ala 


aac 


agg 


aga 


acc 


aag 


acc 


aaa 


act 


tta 


tta 


tct 


Ctg 


ctt 


tea 


ttt 


ctt 


Asn 


Arg 
-20 




Thr 


Lys 


Thr 


Lys 
-15 


Thr 


Leu 


Leu 


Ser 


Leu 
-10 


Leu 


Ser 


Phe 


Leu 


gat 


gaa 


acc 


ecu 


gga 


cca 


age 


aca 




U L. 


cct 


tgt 


tta 


UC t 


etc 


uca 


Asp Glu 


Thr 


Ser 


Gly 


Leu 


Ser 


Thr 


His 


Leu 


Pro 


Cys 


Leu 


Ser 


Leu 


Ser 


-5 










1 








5 










10 




aag 


gag 


tgt 


gga 


gtg 


ctt 


cat 


ctg 


gac 


ate 


cac 


999 


aag 


aag 


gaa 


gac 


Lys 


Glu 


Cys 


Gly 


Val 


Leu 


His 


Leu 


Asp 


lie 


His 


Gly Lys 


Lys 


Glu 


Asp 








15 










20 










25 






atg 


aga 


gat 


gag 


gtc 


ttg 


ctg gcc 


ttg 


aac 


tyc 


tgc 


acc 


cac 


agg 




Met 


Arg 


Asp 
30 


Glu 


val 


Leu 


Leu 


Ala 
35 


Leu 


Asn 


Xaa 


Cys 


Thr 
40 


His 


Arg 





<210> 86 

<211> 336 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 100. .336 



<221> sig_peptide 

<222> 100. .156 

<223> Von Heijne matrix 

score 9.89999961853027 
seq ILPLVFLLAGLRS/KA 



<400> 86 

ecagatctgt tctgcaaeat teaccgttct ctgcateeag ctctgcttat ctgctgttac 60 
ettggacacc agagcageta taggtatctg ccagragcw atg aaa tea ttc age 114 

Met Lys Ser Phe Ser 
-15 

egg ate etc ttc etc gtc ttc etc etc gee gge ctg agg tec aag gee 162 
Arg lie Leu Phe Leu Val Phe Leu Leu Ala Gly Leu Arg Ser Lys Ala 

-10 -5 1 

get ecc tea gcc cct ctg cct ttg gge tgt gge ttt ecg gac atg gee 210 
Ala Pro Ser Ala Pro Leu Pro Leu Gly Cys Gly Phe Pro Asp Met Ala 

5 10 15 

cac ccc tet gag act tee cct ctg aag ggt get tct gaa aat tec aaa 258 
His Pro Ser Glu Thr Ser Pro Leu Lys Gly Ala Ser Glu Asn Ser Lys 

20 25 30 

cga gat cgc ctt aac eca gaa ttt cct ggg act cct tac cct gag cct 306 
Arg Asp Arg Leu Asn Pro Glu Phe Pro Gly Thr Pro Tyr Pro Glu Pro 
35 40 45 50 

tee aag eta cct eat aeg gtt tec ctg gaa 336 
Ser Lys Leu Pro His Thr Val Ser Leu Glu 
55 60 



<210> 87 
<211> 262 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 108. .260 



<221> sigjeptide 
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58 

<222> 108. .230 

<223> Von Heijne matrix 

score 9.89999961853027 

seq SLCHLGWSAWQS/OP 



<400> 87 

taggagtgga gtgactgggt gatatgataa ctctatgttt aactttttaa ggaactgcta 60 
gacttttctg aagtgactat gccattttac attaacacca ggagtgt atg agg gtg 116 

Met Arg Val 
-40 

ccg att ttt cca cat cct cac caa etc teg tta tta ttc ate cat tta 164 
Pro He Phe Pro His Pro His Gin Leu Ser Leu Leu Phe He His Leu 

-35 -30 -25 

ttt att tat tta ttt aga gaa agg gtc tot etc tgt cac eta ggc tgg 212 
Phe He Tyr Leu Phe Arg Glu Arg Val Ser Leu Cys His Leu Gly Trp 

-20 -15 -10 

agt gca gtg gta caa tea cag cca act aca ace ttg acc tec cgc get 260 
Ser Ala Val Val Gin Ser Gin Pro Thr Thr Thr Leu Thr Ser Arg Ala 

-5 15 10 

am 262 



<210> 88 

<211> 149 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 18. .149 



<221> sig_peptide 

<222> 18. .128 

<223> Von Heijne matrix 

score 9.89999961853027 
seq FLFVLFCFGGSRA/LL 



<400> 88 

ttcggagctt gaccage atg tgg aag gag age tct cat ggc tgc aat aac 50 
Met Trp Lys Glu Ser Ser His Gly Cys Asn Asn 
-35 -30 
tta ggg agt tec tac ctg gat gac act ggg gta gga agt ttt ctg ttt 98 
Leu Gly Ser Ser Tyr Leu Asp Asp Thr Gly Val Gly Ser Phe Leu Phe 

-25 -20 -15 

gtt ttg ttc tgt ttc gga ggg tec egt gca ctt etc ttg cct gga tct 146 
Val Leu Phe Cys Phe Gly Gly Ser Arg Ala Leu Leu Leu Pro Gly Ser 
-10 -5 15 

ggg 149 

Gly 



<210> 89 

<211> 315 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> 236. ,313 



<221> sig_peptide 

<222> 236. .283 

<223> Von Heijne matrix 

score 9.69999980926514 
seq FLCLLFYLIVSC6/AV 
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<400> 89 

gtaaaagaca aataacttgt atggtttgca aaatgatctg aatatgtgct tttataacat 60 
tcagaataca cccaaaagta aactttaggt ttaatgtaca gtatgttttc tatgtaattg 120 
ttttgaataa gtaatamcat ybtacatggc ttaaaactga aaaacgtatt cctgttactt 180 
cttgatgctt ttgagaaatg aataatgttt tctccctttt aaatggtagt acagc atg 238 

Met 

cac act ttt ctg tgc ttg ctt ttt tat etc ata gta tct tgt gga get 286 
His Thr Phe Leu Cys Leu Leu Phe Tyr Leu- lie Val Ser Cys Gly Ala 
-15 -10 -5 1 

gtt ttc tta aca gtc cct tct ccc caa gg 315 
Val Phe Leu Thr Val Pro Ser Pro Gin 
5 10 



<210> 90 

<211> 179 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 24. .179 

<221> sig_peptide 

<222> 24 . .140 

<223> Von Heijne matrix 

score 9.69999980926514 
seq SIILXLXFPGILG/QA 

<221> misc_feature 
<222> 57 

<223> n=a, g, c or t 



<400> 90 

agtnrctctgg ggcagtctgc ace atg gee tgg cac ccc act cct cct cct ctt 53 

Met Ala Trp His Pro Thr Pro Pro Pro Leu 
-35 -30 
csb new cct cct cca ctg mac agg gwc tcy etc cea gcc tgt get gac 101 
Xaa Xaa Pro Pro Pro Leu Xaa Arg Xaa Ser Leu Pro Ala Cys Ala Asp 

-25 -20 -15 

tea ate ate etc tgm etc tgm ttc cct ggg ate etc ggw caa get cac 149 
Ser lie lie Leu Xaa Leu Xaa Phe Pro Gly lie Leu Gly Gin Ala His 

-10 -5 1 

ctg mac tct gag cag tgg aca cag tac eta 179 
Leu Xaa Ser Glu Gin Trp Thr Gin Tyr Leu 
5 10 



<210> 91 

<211> 423 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 311. .421 



<221> sigjpeptide 

<222> 311. .373 

<223> Von Heijne matrix 

score 9.69999980926514 
seq LHLLILLSGTCFT/WI 
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<400> 91 

gactctatag srcaaatggt taagaacata tacttgggag tcagttgatc tgggttcaaa 60 
ttctagctgt gctactttct acctatgctg tattggacaa atgatactgt gtatctgttt 120 
cttcaaccgt aagttgggta tattaatatc cttacctcaa aaggtcatga tgattaagtg 180 
agtbaatgca tgtaaaatgc cttctgtgcc gggcagtcag aaaccactca ataaatattg 240 
attattctca ccaaagatgt gcttcctgac ctcaaaagcc tgtcagccta atataaagac 300 
agtgtgacaa atg cca ate ctg cct cag gac ate ttg cac ttg ctg ate 34 9 

Met Pro He Leu Pro Gin Asp He Leu His Leu Leu He 
-20 -15 -10 

ctt ctg tot gga aca tgc ttc act tgg att ctt ttg tgg ctt cca etc 397 
Leu Leu Ser Gly Thr Cys Phe Thr Trp He Leu Leu Trp Leu Pro Leu 

-5 15 
tec eet ctg ttg ggc etg aaa tgc ta 423 
Ser Pro Leu Leu Gly Leu Lys Cys 
10 15 

<210> 92 

<211> 316 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 62. .316 

<22l> sig_peptide 

<222> 62. .121 

<223> Von Heijne matrix 

score 9.60000038146973 

seq LLALLLCGRPGRG/QT 

<221> raise_feature 

<222> 264,266 

<223> n=:a, g, c or t 



<400> 92 

accgcagctc cagagccctg cgggaggact eagagtcagg gacacagcag cgtccggcga 60 
g atg aag gcg etc ggg get gte etg ctt geb etc ttg ctg tgc ggg egg 109 
Met Lys Ala Leu Gly Ala Val Leu Leu Ala Leu Leu Leu Cys Gly Arg 
-20 -15 -10 -5 

cca ggg aga ggg cag aca cag cag gag gaa gag gaa gag gac gag gac 157 
Pro Gly Arg Gly Gin Thr Gin Gin Glu Glu Glu Glu Glu Asp Glu Asp 

15 10 
cac ggg cca gat gac tac gac gag gaa gat gag gat gag gtt gaa gag 205 
His Gly Pro Asp Asp Tyr Asp Glu Glu Asp Glu Asp Glu Val Glu Glu 

15 20 25 

gag gag aec aae agg etc ect ggt ggc agg age aga gtg etg etg egg 253 
Glu Glu Thr Asn Arg Leu Pro Gly Gly Arg Ser Arg Val Leu Leu Arg 

30 35 40 

tgc tac aec tnk nag tec ctg cec agg gac gag egc tgc aac ctg aeg 301 
Cys Tyr Thr Xaa Xaa Ser Leu Pro Arg Asp Glu Arg Cys Asn Leu Thr 
45 50 55 60 

eag aae tgc tea eat 316 
Gin Asn Cys Ser His 
65 

<210> 93 

<211> 508 

<212> DNA 

<213> Homo sapiens 



<220> 
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<221> CDS 
<222> 245. .508 

<221> sig_peptide 

<222> 245. .289 

<223> Von Heijne matrix 

score 9.60000038146973 

seq EYVLLLFLALCSA/KP 

<400> 93 

agtactaaca tggactaatc tgtgggagca gtttattcca gtatcaccca gggtgcagcc 60 
acaccaggac tgtgttgaag ggtgtttttt ttcttttaaa tgtaatacct cctcatcttt 120 
tcttcttaca cagtgtctga gaacatttac attatagata agtagtacat ggtggataac ISO 

ttctactttt aggaggacta ctctcttctg acagtcctag actggtcttc tacactaaga 240 
cacc atg aag gag tat gtg etc eta tta ttc ctg get ttg tgc tct gee 289 

Met Lys Glu Tyr Val Leu Leu Leu Phe Leu Ala Leu Cys Ser Ala 

-15 -10 -5 



aaa 


ccc 


ttc 


ttt 


age 


cct 


tea 


cac 


ate 


gca 


ctg 


aag 


aat 


atg atg 


ctg 


337 


Lys 


Pro 


Phe 


Phe 


Ser 


Pro 


Ser 


His 


He 


Ala 


Leu 


Lys 


Asn 


Met Met 


Leu 




1 








5 










10 








15 






aag 


gat 


atg 


gaa 


gac 


aca 


gat 


gat 


gat 


gat 


gat 


gat 


gat 


gat gat 


gat 


385 


Lys 


Asp 


Met 


Glu 
20 


Asp 


Thr 


Asp 


Asp 


Asp 
25 


Asp 


Asp 


Asp 


Asp 


Asp Asp 
30 


Asp 




gat 


gat 


gat 


gag 


gac 


aac 


tct 


ett 


ttt 


cca 


aca 


aga 


gag 


cca aga 


age 


433 


Asp 


Asp 


Asp Glu Asp 


Asn 


Ser 


Leu 


Phe 


Pro 


Thr 


Arg 


Glu 


Pro Arg 


Ser 








35 










40 










45 








cat 


ttt 


ttt 


cca 


ttt 


gat 


ctg 


ttt 


cca 


atg 


tgt 


cca 


ttt 


gga tgt 


cag 


481 


His 


Phe 


Phe 


Pro 


Phe 


Asp 


Leu 


Phe 


Pro 


Met 


Cys 


Pro 


Phe 


Gly Cys Gin 






50 










55 










60 










tgc 


tat 


tea 


cga 


gtt 


gta 


eat 


tgc 


tea 














508 


Cys 


Tyr 


Ser Arg 


Val 


Val 


His 


Cys 


Ser 

















65 70 

<210> 94 
<211> 321 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 36. .320 

<221> sig_peptide 

<222> 36. . 92 

<223> Von Heijne matrix 

score 9.60000038146973 

seq FLLLVAAPRWAMS/QV 

<400> 94 

actttctgag aggcetggae cteetgeaea agaac atg aaa cac ctg tgg ttc 53 

Met Lys His Leu Tip Phe 
-15 



ttc 


etc 


etc 


ctg 


gtg 


gca 


get 


ccc 


aga 


tgg 


gee 


atg 


tct 


cag 


gtg 


caa 


101 


Phe 


Leu 


Leu 


Leu 


Val 


Ala 


Ala 


Pro 


Arg 


Trp 


Ala 


Met 


Ser 


Gin 


Val 


Gin 










-10 










-5 










1 








ctg 


cag 


gaa 


teg 


ggc 


ecg 


aga 


ctg 


gtg 


aaa 


cct 


teg 


ggg 


ace 


ctg 


tec 


14 9 


Leu 


Gin 


Glu 


Ser 


Gly 


Pro 


Arg 


Leu 


Val 


Lys 


Pro 


Ser 


Gly 


Thr 


Leu 


Ser 






5 










10 










15 












etc 


acc 


tgc 


agt 


gte 


tct 


ggt 


ggc 


tec 


atg 


gee 


act 


agt 


gac 


tgg 


tgg 


197 


Leu 


Thr 


Cys 


Ser 


Val 


Ser 


Gly 


Gly 


Ser 


Met 


Ala 


Thr 


Ser 


Asp 


Trp 


Trp 




20 










25 










30 










35 




agt 


tgg 


ttt 


cga 


cag 


acm 


ccg 


gag 


aag 


ggt 


ctg 


gag 


tgg 


att 


ggg 


gaa 


245 


Ser 


Trp 


Phe 


Arg Gin 


Thr 


Pro 


Glu 


Lys 


Gly 


Leu 


Glu 


Trp 


He 


Gly Glu 
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40 45 50 

ate ttt cag act ggg ccc acc aat tac aac ccg tec etc aag age egc 293 
lie Phe Gin Thr Gly Pro Thr Asn Tyr Asn Pro Ser Leu Lys Ser Arg 

55 60 65 

gtc tec atg tea gtg gac atg tee aag a 321 
Val Ser Met Ser Val Asp Met Ser Lys 
70 75 



<210> 95 
<211> 402 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 15. .401 



<22l> sig_peptide 

<222> 15. .92 

<223> Von Heijne matrix 

score 9.5 

seq FLLLVAAPRWALS/QL 



<400> 95 

gctttctgag agtc atg gat 
Met Asp 
-25 

tte ttc etc ctg ctg gtg 
Phe Phe Leu Leu Leu Val 
-10 

eag ctg cag gag teg ggc 
Gin Leu Gin Glu Ser Gly 
5 

tec etc acg tgc act gtc 
Ser Leu Thr Cys Thr Val 
20 

tgc tgg gee tgg ate egc 
Cys Trp Ala Trp lie Arg 
35 40 
ggg act gta tgt tat ggt 
Gly Thr Val Cys Tyr Gly 
55 

agt cga gtc aag tta teg 
Ser Arg Val Lys Leu Ser 
70 

aag gtc ace tct atg acc 
Lys Val Thr Ser Met Thr 
85 

ggg ctg egt gtt agt g 
Gly Leu Arg Val Ser 
100 



etc acg tgc aag aaa atg 
Leu Thr Cys Lys Lys Met 
-20 

geg get ccc aga tgg gee 
Ala Ala Pro Arg Trp Ala 

-5 

cea gga ctg gtg aag cet 
Pro Gly Leu Val Lys Pro 
10 

tct ggt gaa tec ate acc 
Ser Gly Glu Ser lie Thr 
25 30 

eag ccc ccg ggg aag ggg 
Gin Pro Pro Gly Lys Gly 
45 

ggg ace acc tac kre aac 
Gly Thr Thr Tyr Xaa Asn 
60 

ttg gac acg tec acg aat 
Leu Asp Thr Ser Thr Asn 
75 

gee gga gac geg get gtc 
Ala Gly Asp Ala Ala Val 
90 



aag eac ctg tgg 50 
Lys His Leu Trp 
-15 

ctg tec caa ctg 98 
Leu Ser Gin Leu 
1 

teg gag ace ctg 146 

Ser Glu Thr Leu 

15 

act aat tea ttc 194 
Thr Asn Ser Phe 

ctg gaa tgg ctt 242 
Leu Glu Trp Leu 
50 

keg tec ctg aag 290 
Xaa Ser Leu Lys 
65 

cag ttc tec ctg 338 
Gin Phe Ser Leu 
80 

eat tac tgt geg 386 

His Tyr Cys Ala 

95 

402 



<210> 96 

<211> 315 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 118.. 315 



<221> sig_peptide 
<222> 118. .306 



wo 99/53051 



PCT/IB99/00712 



63 

<223> Von Heijne matrix 
score 9 . 5 

seq VLLFLILLYMSWS/AS 



<400> 96 

agagacacac ttggacgrtt cctgcagraa tcagtgaggc agtctcctcc caggggcttg 60 
gsgcctggct cgaggcgagg ctgccggccc ggacgctgac tgcccagtgc cacagac 117 
atg gcc aac ggg acc aac gcc tct gcc cca tac tac age tat gaa tac 165 
Met Ala Asn Gly Thr Asn Ala Ser Ala Pro Tyr Tyr Ser Tyr Glu Tyr 

-60 -55 -50 

tac ctg gac tat ctg gac etc att ccc gtg gac gag aag aag ctg aaa 213 
Tyr Leu Asp Tyr Leu Asp Leu lie Pro Val Asp Glu Lys Lys Leu Lys 

-45 -40 -35 

gcc cac aaa cat tec ate gtg ate gea ttc tgg gtg age ctg get gcc 261 
Ala His Lys His Ser lie Val lie Ala Phe Trp Val Ser Leu Ala Ala 

-30 -25 -20 

ttc gtg gtg ctg etc ttc etc ate ttg etc tac atg tec tgg tec ges 309 
Phe Val Val Leu Leu Phe Leu lie Leu Leu Tyr Met Ser Trp Ser Ala 
-15 -10 -5 1 

tec ccg 315 
Ser Pro 



<210> 97 

<211> 460 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 62. .460 



<221> sig_peptide 

<222> 62- .118 

<223> Von Heijne matrix 

score 9.39999961853027 
seq FLFWAAATGVQS/QX 



<400> 97 

ageatcacat aacaaecasa ttcctcctet aaagaagcee ctgggageae agetcatcac 60 
c atg gac tgg acc tgg agg ttc etc ttt gtg gtg gea gca get aca ggt 109 
Met Asp Trp Thr Trp Arg Phe Leu Phe Val Val Ala Ala Ala Thr Gly 
-15 -10 -5 



gtc 


cag 


tern 


cag 


gks 


cas 


ctg 


gwg 


cag 


tct 


ggg 


get 


gag 


gtg 


aag 


aag 


157 


Val 


Gin 


Ser 


Gin 
1 


Xaa 


Xaa 


Leu 


Xaa 

5 


Gin 


Ser 


Gly 


Ala 


Glu 
10 


Val 


Lys 


Lys 




cet 


ggg 


tec 


teg 


gtg 


aaa 


gtc 


tee 


tgc 


arg 


gey 


tct 


gga 


gge 


ate 


ytc 


205 


Pro 


Gly 
15 


Ser 


Ser 


Val 


Lys 


Val 
20 


Ser 


Cys 


Xaa 


Ala 


Ser 
25 


Gly 


Gly 


He 


Xaa 




age 


asc 


tat 


age 


ttc 


aac 


tgg 


gtg 


egm 


cag 


gee 


cet 


gga 


cag 


ggg 


ttt 


253 


Ser 


Xaa 


Tyr 


Ser 


Phe 


Asn 


Trp 


Val 


Arg 


Gin 


Ala 


Pro 


Gly 


Gin 


Gly 


Phe 




30 










35 










40 










45 




gag 


tgg 


ttg 


gga 


agg 


ate 


ate 


ccc 


ate 


etc 


ggt 


ata 


aca 


aac 


tac 


gea 


301 


Glu 


Trp 


Leu 


Gly 


Arg 
50 


He 


He 


Pro 


He 


Leu 

55 


Gly 


He 


Thr 


Asn 


Tyr 
60 


Ala 




gag 


aag 


ttt 


egg 


gge 


aga 


etc 


acg 


ate 


acc 


gtg 


gac 


aaa 


tee 


acg 


cgt 


349 


Glu 


Lys 


Phe 


Arg 
65 


Gly 


Arg 


Leu 


Thr 


He 
70 


Thr 


Val 


Asp 


Lys 


Ser 
75 


Thr 


Arg 




gtt 


gtt 


tac 


atg 


gag 


cag 


age 


agt 


ctg 


aca 


tct 


gcg 


gac 


acg 


gcc 


gta 


397 


Val 


Val 


Tyr 
80 


Met 


Glu 


Gin 


Ser 


Ser 
85 


Leu 


Thr 


Ser 


Ala 


Asp 
90 


Thr 


Ala 


Val 




tat 


tat 


tgt 


gcg 


aaa 


ccg 


act 


atg 


act 


teg 


gaa 


eta 


egg 


gtc 


tac 


tat 


445 


Tyr 


Tyr 
95 


Cys 


Ala 


Lys 


Pro 


Thr 
100 


Met 


Thr 


Ser 


Glu 


Leu 
105 


Arg 


Val 


Tyr 


Tyr 
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cag wet aca eta tgg 460 

Gin Xaa Thr Leu Trp 

110 

<210> 98 

<211> 230 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 140.. 229 

<221> sig_j>eptide 

<222> 140.. 205 

<223> Von Heijne matrix 

score 9.39999961853027 

seq LLLLSAFTSQTVS/GQ 

<400> 98 

aacagaacaa tatcaaatag ctaacttcac ccccaaccac agtccttgct gttggcattt 60 
actcaactag tctttaattc ctgttttgac aaactttata aggtgctaca agacagatga 120 
tttttcacca tctaccata atg tgg aac aga tat ttt gtc ttc tat etc ctg 172 

Met Trp Asn Arg Tyr Phe Val Phe Tyr Leu Leu 
-20 -15 
ctt ttg tea gcg ttt acg agt caa aca gta tec gga caa aga aag aaa 220 
Leu Leu Ser Ala Phe Thr Ser Gin Thr Val Ser Gly Gin Arg Lys Lys 

-10 -5 15 

gga ccc egg g 230 
Gly Pro Arg 

<210> 99 

<211> 467 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 40. .465 

<221> sigjpeptide 

<222> 40. .96 

<223> Von Heijne matrix 

score 9.39999961853027 

seq FLLLVAAPRWVLS/QL 

<400> 99 

aaataetttc tgagagecet ggacctectg tgeaagaae atg aaa cae etg ggg 54 

Met Lys His Leu Gly 
-15 



ttc 


ttc 


etc 


ctg 


ctg 


gtg 


gca 


get 


ccc 


aga 


tgg 


gtc 


ctg 


tec 


cag 


ctg 


102 


Phe 


Phe 


Leu 


Leu 


Leu 
-10 


Val 


Ala 


Ala 


Pro 


Arg 
-5 


Trp 


Val 


Leu 


Ser 


Gin 
1 


Leu 




cag 


etc 


cag 


gag 


tec 


ggc 


tea 


gga 


etg 


gag 


aag 


cet 


tea 


eag 


ace 


etg 


150 


Gin 


Leu 


Gin 


Glu 


Ser 


Gly 


Ser 


Gly 


Leu 


Glu 


Lys 


Pro 


Ser 


Gin 


Thr 


Leu 








5 










10 










15 










tec 


etc 


ace 


tgc 


tct 


gtc 


tct 


ggt 


ggc 


tec 


ate 


agt 


agt 


gat 


gat 


ttg 


198 


Ser 


Leu 


Thr 


Cys 


Ser 


Val 


Ser 


Gly 


Gly 


Ser 


He 


Ser 


Ser 


Asp 


Asp 


Leu 






20 










25 










30 












teg 


tgg 


age 


tgg 


ate 


cga 


cag 


eeg 


cca 


ggg 


aag 


ggc 


ctg 


gag 


tgg 


att 


246 


Ser 


Trp 


Ser 


Trp 


lie 


Arg 


Gin 


Pro 


Pro 


Gly 


Lys 


Gly 


Leu 


Glu 


Trp 


He 




35 










40 










45 










50 




ggc 


tac 


att 


tat 


caa 


aat 


gag 


agg 


ace 


etc 


tac 


aac 


ecg 


tec 


etc 


aag 


294 
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Gly Tyr lie Tyr Gin Asn 
55 

agt cga gcc gcc att tea 
Ser Arg Ala Ala lie Ser 
70 

aaa ctg acc tct gtg acc 
Lys Leu Thr Ser Val Thr 
85 

acc agt gtc atg awt tec 
Thr Ser Val Met Xaa Ser 
100 

ttg act act ggg gcc agg 
Leu Thr Thr Gly Ala Arg 
115 120 



65 

Glu Arg Thr Leu Tyr Asn 
60 

gtg gae agg tec aag aac 
Val Asp Arg Ser Lys Asn 
75 

gcc gcg gac atg gcc gta 
Ala Ala Asp Met Ala Val 
90 

ttt ggg ggc gtt etc gtc 
Phe Gly Gly Val Leu Val 
105 110 
gaa tct cgt ca 
Glu Ser Arg 
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Pro Ser Leu Lys 
65 

eag ttc tec ctg 342 
Gin Phe Ser Leu 
80 

tat tac tgt gcc 390 
Tyr Tyr Cys Ala 

95 

cct aat ctg ttt 438 
Pro Asn Leu Phe 

467 



<210> 100 
<211> 504 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 39. .503 



<221> sigjpeptide 

<222> 39. .95 

<223> Von Heijne matrix 

score 9.30000019073486 
seq FLLLVAGPRWVLS/QV 



<400> 100 

aatactttct gagagtcctg gacctcctgt gcaagaac atg aaa cac ctg tgg ttc 56 

Met Lys His Leu Trp Phe 
-15 

ttc etc ctg ctg gtg gca ggt cce aga tgg gtc ctg tec eag gtg eag 104 
Phe Leu Leu Leu Val Ala Gly Pro Arg Trp Val Leu Ser Gin Val Gin 

-10 -5 1 

ctg sdk gag teg ggc cca aga ctg gtg aag cct tea eag acc ctg tee 152 
Leu Xaa Glu Ser Gly Pro Arg Leu Val Lys Pro Ser Gin Thr Leu Ser 

5 10 15 

etc acc tgc act gta tct ggg gcc tec gtc age agt cgt ggg tac tat 200 
Leu Thr Cys Thr Val Ser Gly Ala Ser Val Ser Ser Arg Gly Tyr Tyr 
20 25 30 35 

tgg acc tgg ate cgc cag etc cca ggg aag ggc ctg gag tgg att ggc 248 
Trp Thr Trp lie Arg Gin Leu Pro Gly Lys Gly Leu Glu Trp lie Gly 

40 45 50 

tac ate tgk tac act ggg age acc ttc tac aac ccg tec etc aag agt 296 
Tyr lie Xaa Tyr Thr Gly Ser Thr Phe Tyr Asn Pro Ser Leu Lys Ser 

55 60 65 

cga tta ace ata tea ata gac acg tct aag aat cag ttc tee ctg aac 344 
Arg Leu Thr lie Ser lie Asp Thr Ser Lys Asn Gin Phe Ser Leu Asn 

70 75 80 

ctg agg tct gtg act acc gcg gac acg gcc gtc tat tac tgt gcg aga 392 
Leu Arg Ser Val Thr Thr Ala Asp Thr Ala Val Tyr Tyr Cys Ala Arg 

85 90 95 

gae eat ttc gat ett eta ttc gae eee tgg ggc eag gga ace ctg gtc 440 
Asp His Phe Asp Leu Leu Phe Asp Pro Trp Gly Gin Gly Thr Leu Val 
100 105 110 115 

acc gtc tec tct gem tec acc aag ggc cca teg gtc ttc cce ctg gca 488 
Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala 

120 125 130 

sec tec tec aag age a 504 
Xaa Ser Ser Lys Ser 
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135 



<210> 101 
<211> 336 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 148. .336 



<221> sigjpeptide 

<222> 148. .270 

<223> Von Heijne matrix 

score 9.30000019073486 
seq VLXLFCFVFEAES/RS 



<400> 101 

agagctcgcg gtggactccg acccggcgca acatggccgc agcctcgcct ctgcgcgact 60 
gccaggcctg gaaggatgcg aggctcccgc tctccaccac aagcaacgaa gcctgcaagc 120 
tgttcgatgc cacgctgacc cagggta atg gcc tgc cga gag agg ccg egg ccc 174 

Met Ala Cys Arg Glu Arg Pro Arg Pro 
-40 -35 



ctt 


ctg 


tgg 


agg 


tct 


agg 


gga agg 


ttt 


ttt 


aat 


tgg 


gga 


aag 


ctg 


ttt 


222 


Leu 


Leu 


Trp 


Arg 


Ser 


Arg 


Gly Arg Phe 


Phe 


Asn 


Trp 


Gly 


Lys 


Leu 


Phe 








-30 








-25 










-20 










ttt 


tgt 


ttt 


gtt 


ttg 


mtt 


ttg ttt 


tgt 


ttt 


gtt 


ttt 


gag 


gcg 


gag 


tct 


270 


Phe 


Cys 
-15 


Phe 


Val 


Leu 


Xaa 


Leu Phe 

-10 


Cys 


Phe 


Val 


Phe 
-5 


Glu 


Ala 


Glu 


Ser 




cgc 


tct 


gtc 


gcc 


cag 


get 


gga gtg 


cag 


tgg 


cgc 


tat 


ttc 


ggc 


tea 


eta 


318 


Arg 


Ser 


Val 


Ala 


Gin 


Ala 


Gly Val 


Gin Trp Arg Tyr Phe Gly 


Ser 


Leu 




1 








5 








10 










15 






caa 


get 


ttg 


cct 


ccc 


tgg 




















336 


Gin 


Ala 


Leu 


Pro 


Pro 


Trp 























20 



<210> 102 
<211> 289 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 214. .288 



<221> sig_peptide 

<222> 214 . .276 

<223> Von Heijne matrix 

score 9.19999980926514 
seq FILFHWSLCLCLC/QY 



<400> 102 

cccttatgtt ttcttttagt aatttcatag tttcaagttt tatatataag tctttaatcc 60 
attttgagtt gatttgtgta tatggtggag acagggtcta gtcttggtet tctgcatgtg 120 
actttccaat tttcccagca ceatttattg gagaaactgt ctttttccca gtgeatgttc 180 
ttggcacctt tgttgaaaaa cagttggcca tag atg cat gaa ttt att tct ggg 234 

Met His Glu Phe lie Ser Gly 
-20 -15 
ttc ttt att etc ttt cat tgg tct ctg tgt ttg tgt tta tgc caa tac 282 
Phe Phe lie Leu Phe His Trp Ser Leu Cys Leu Cys Leu Cys Gin Tyr 

-10 -5 1 

cat gcc g 289 
His Ala 
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<210> 103 
<211> 383 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 252. .383 

<221> sig_peptide 

<222> 252 . .377 

<223> Von Heijne matrix 

score 9.19999980926514 

seq LLVCLFAVTSILC/SS 

<400> 103 

atcctccagc taataagtgt ccaagctggg actcaaactt gggcctttta actgtgctgc 60 
tattctacct ctcccttgct ctttccagac caggcttggg acataacact aacacccttt 120 
tcctttcatt tcatctcttg tccttcagtc attcctaaac attgacaybc attgagttcc 180 
ttggctctgg ccatagtcct ttctcccttt cccctctggg gcatcaaata gtgattacag 240 
tatccacagg g atg gca tat gcc att tea cca ttt cac agt tec tgg aat 290 
Met Ala Tyr Ala lie Ser Pro Phe His Ser Ser Trp Asn 
-40 -35 -30 

cca ctt ttc act tct cat aaa get tea gca age cat tct cat ett ggg 338 
Pro Leu Phe Thr Ser His Lys Ala Ser Ala Ser His Ser His Leu Gly 

-25 -20 -15 

ttg ctt gtt tgc eta ttt get gtt aea tec att etc tge tec tea 383 
Leu Leu Val Cys Leu Phe Ala Val Thr Ser lie Leu Cys Ser Ser 
-10 -5 1 

<210> 104 

<211> 211 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 30. ,209 

<221> sigjpeptide 

<222> 30. .74 

<223> Von Heijne matrix 

score 9.19999980926514 

seq PVLLLALLGFILP/LP 

<221> misc_feature 
<222> 83 

<223> n=a, g, c or t 



<400> 104 

agaaagagat taccagccac agacgggte atg 

Met 
-15 

etc etg ggg ttc ate etc cca ctg cca 
Leu Leu Gly Phe lie Leu Pro Leu Pro 

-5 1 
gcc agt ttg gga cag ttc age atg tgt 
Ala Ser Leu Gly Gin Phe Ser Met Cys 
10 15 
ggc aat gga ccc eta aga aea cca get 
Gly Asn Gly Pro Leu Arg Thr Pro Ala 



age ecg gta tta etg etg gee 53 
Ser Pro Val Leu Leu Leu Ala 
-10 

ggn agt gca rge get gss tek 101 
Gly Ser Ala Xaa Ala Xaa Ser 
5 

gga agg tgt ecg acm tgc ccc 149 
Gly Arg Cys Pro Thr Cys Pro 

20 25 
gcg aca sgg vtt rgg gtg cca 197 
Ala Thr Xaa Xaa Xaa Val Pro 
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30 35 40 

gga cac gtt gat gc 211 
Gly His Val Asp 
45 



<210> 105 
<211> 492 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 29. .490 

<221> sig_peptide 

<222> 29. .97 

<223> Von Heijne matrix 

score 9.10000038146973 
seq SLLLFSLMCETSA/FY 



<400> 105 

agtcattacg gcgacacgtg gatccaag atg gcg acg gcg atg gat tgg ttg 52 

Met Ala Thr Ala Met Asp Trp Leu 

-20 

ccg tgg tct tta ctg ctt ttc tec ctg atg tgt gaa aca age gee ttc 100 

Pro Trp Ser Leu Leu Leu Phe Ser Leu Met Cys Glu Thr Ser Ala Phe 

-15 -10 -5 1 

tat gtg cct ggg gtc gcg cct ate aac ttc cac cag aac gat ccc gta 148 

Tyr Val Pro Gly Val Ala Pro lie Asn Phe His Gin Asn Asp Pro Val 

5 10 15 

gaa ate aag get gtg aag etc acc age tct cga ace cag eta cct tat 196 
Glu lie Lys Ala Val Lys Leu Thr Ser Ser Arg Thr Gin Leu Pro Tyr 

20 25 30 

gaa tac tat tea ctg ccc ttc tge cag ccc age aag ata ace tac aag 244 
Glu Tyr Tyr Ser Leu Pro Phe Cys Gin Pro Ser Lys lie Thr Tyr Lys 

35 40 45 

gca gag aat ctg gga gag gtg ctg aga ggg gac egg att gtc aac acc 292 
Ala Glu Asn Leu Gly Glu Val Leu Arg Gly Asp Arg lie Val Asn Thr 
50 55 60 65 

cct ttc cag gtt etc atg aac age gag aag aag tgt gaa gtt ctg tge 340 
Pro Phe Gin Val Leu Met Asn Ser Glu Lys Lys Cys Glu Val Leu Cys 

70 75 80 

age cag tec aac aag cca gtg acc ctg aca gtg gag cag age cga etc 388 
Ser Gin Ser Asn Lys Pro Val Thr Leu Thr Val Glu Gin Ser Arg Leu 

85 90 95 

gtg gee gag egg ate aca gaa gac tac tac gtc cac etc att get gac 436 
Val Ala Glu Arg lie Thr Glu Asp Tyr Tyr Val His Leu lie Ala Asp 

100 105 110 

aac ctg cct gtg gee ace ggc tgg age tct act cca acc gag aca gcg 4 84 

Asn Leu Pro Val Ala Thr Gly Trp Ser Ser Thr Pro Thr Glu Thr Ala 

115 120 125 

atg aca ag 4 92 

Met Thr 
130 



<210> 106 
<211> 126 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 41. .124 
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<221> sig_peptide 

<222> 41. .94 

<223> Von Heijne matrix 

score 9.10000038146973 

seq LISLLQCAHVSLG/LQ 

<400> 106 

tccatttgca gcccatggtt gtcacatcac gctgtttata atg ccc age ccc tgc 55 

Met Pro Ser Pro Cys 
-15 

ctg ate tct ctt ctt caa tgt get cat gtg tec ctt ggc tta cag tat 103 
Leu lie Ser Leu Leu Gin Cys Ala His Val Ser Leu Gly Leu Gin Tyr 

-10 -5 1 

eea tge stt etc ctt cte cct ce 126 
Pro Cys Xaa Leu Leu Leu Pro 
5 10 

<210> 107 
<211> 242 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 82. .240 

<221> sigjpeptide 

<222> 82. .132 

<223> Von Heijne matrix 

score 9.10000038146973 

seq LVLAAFCLGIASA/VP 

<400> 107 

aeeagaecgc ggaegtetgt aatctcagag gcttgtttgc tgagggtgce tgegcastgc 60 
gaeggetgct ggttttgaaa e atg aat ett teg etc gte ctg get gee ttt 111 

Met Asn Leu Ser Leu Val Leu Ala Ala Phe 
-15 -10 
tgc ttg gga ata gee tec get gtt eea aaa ttt gac caa aat ttg gat 159 
Cys Leu Gly He Ala Ser Ala Val Pro Lys Phe Asp Gin Asn Leu Asp 

-5 15 
aca aag tgg tac cag tgg aag gca aca cac aga aga tta tat gge gcg 207 
Thr Lys Trp Tyr Gin Trp Lys Ala Thr His Arg Arg Leu Tyr Gly Ala 
10 15 20 25 

aat gaa gaa gga tgg agg aga gca gcg tgg gag gg 242 
Asn Glu Glu Gly Trp Arg Arg Ala Ala Trp Glu 
30 35 

<210> 108 
<211> 336 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 81. .335 

<221> sigjpeptide 

<222> 81. .137 

<223> Von Heijne matrix 

score 9 

seq WVFLVAIFTGVHC/EV 
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<400> 108 

agctctggga gaggagcccc agccgtgaga ttcccagaag tttccacttg gtgatcagca 60 
ctgaacacag accaccaacc atg gag ttt ggc ctt aat tgg gtt ttc ctt gtt 113 

Met Glu Phe Gly Leu Asn Trp Val Phe Leu Val 
-15 -10 
get att ttt aca ggt gtc cac tgt gag gtg cag ttg gtg gag tct ggg 161 
Ala lie Phe Thr Gly Val His Cys Glu Val Gin Leu Val Glu Ser Gly 

-5 15 
gga gac ctg gta cag cca ggg egg tec etg aga etc tec tgt aca get 209 
Gly Asp Leu Val Gin Pro Gly Arg Ser Leu Arg Leu Ser Cys Thr Ala 

10 15 20 

tct gga ttc acc ttt ggt gat tat gcc atg acc tgg ttc cgc cag get 257 
Ser Gly Phe Thr Phe Gly Asp Tyr Ala Met Thr Trp Phe Arg Gin Ala 
25 30 35 40 

tea ggg aag cga ctg gag tgg eta ggt ttc att aga aat aga ggt tcs 305 
Ser Gly Lys Arg Leu Glu Trp Leu Gly Phe lie Arg Asn Arg Gly Ser 

45 50 55 

ggt ggg tea gca gag tac ggc gcg tct gtg a 336 
Gly Gly Ser Ala Glu Tyr Gly Ala Ser Val 
60 65 



<210> 109 
<211> 160 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 6. .158 



<221> sig_j)eptide 
<222> 6. .56 

<223> Von Heijne matrix 
score 9 

seq LLILLMLLLFAIH/IN 



<400> 109 

cagct atg aaa aac tgc eta etc ata etc etc atg ctt etc tta ttt gca 50 
Met Lys Asn Cys Leu Leu lie Leu Leu Met Leu Leu Leu Phe Ala 
-15 -10 -5 

ata cac ata aac cgt atg aat gta agg aat gtg gga aat act tta gtc 98 
lie His lie Asn Arg Met Asn Val Arg Asn Val Gly Asn Thr Leu Val 

15 10 
gta gtg caa ate tta ttc age ate aga gta ttc ata ctg gag aga aac 146 
Val Val Gin lie Leu Phe Ser lie Arg Val Phe lie Leu Glu Arg Asn 
15 20 25 30 

cct ttg aat gtg gg 160 
Pro Leu Asn Val 



<210> 110 
<211> 527 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 81. .527 



<221> sig_peptide 
<222> 81. .137 
<223> Von Heijne matrix 
score 9 

seq WIPLLAILKGVQC/EV 
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<221> misc_feature 
<222> 307. .308,466. .467 
<223> n=a, g, c or t 



<400> 110 

agctctggga gaggagcccc agccctgaga ttcccaggtg tttccattca gtgatcagca 60 
ctgaacacag aggactcacc atg gag ttg gga ctg age tgg att ttc ctt ttg 113 

Met Glu Leu Gly Leu Ser Trp lie Phe Leu Leu 
-15 -10 



get 


att 


tta 


aaa 


99 1 


gtc 


cag 


tgt 


gaa 


gtg 


cag 


ctg 


9tg 


gag 


tct 


ggg 


161 


Ala 


He 


Leu 


Lys 
-5 


Gly 


Val 


Gin 


Cvs 


Glu 
1 


Val 


Gin 


Leu 


Val 
5 


Glu 


Ser 


Gly 




gga 


ggc 


ttg 


gta 


caa 


cct 


QQC 


aoQ 


tec 


ctg 


aga 


etc 


tec 


tqt 


Qca 


Qce 


209 


Gly Gly 


Leu 


Val 


Gin 


Pro 


Glv 


Ara 


Ser 


Leu 


Ara 


Leu 


Ser 


Cys 


Ala 


Ala 






10 










15 










20 












tct 


gga 


ttc 


acc 


U L. U 


gat 


gat 


tac 


gcc 


atg 


cac 




gtc 


*-yy 


caa 


get 


257 


Ser Gly 


Phe 


Thr 


Phe 


Asp 


Asp 


Tyr 


Ala 


Met 


His 


Trp 


Val 


Arg 


Gin 


Ala 




25 










30 










35 










40 




cca 


ggg 


aag 


ggc 


ctg 


gag 


tgg 


gtc 


tea 


gga 


att 


act 


tgg 


aat 


agt 


ggt 


305 


Pro Gly 


Lys 


Gly 


Leu 


Glu 


Trp 


Val 


Ser Gly 


He 


Thr 


Trp 


Asn 


Ser 


Gly 












45 










50 










55 






ann 


ata 


ggc 


tac 


gcg 


gac 


tct 


gtg 


aag 


ggc 


ega 


ttc 


acc 


ate 


tec 


aga 


353 


Xaa 


He 


Gly Tyr 


Ala 


Asp 


Ser 


Val 


Lys 


Gly 


Arg 


Phe 


Thr 


He 


Ser 


Arg 










60 










65 










70 








gac 


aac 


gee 


aag 


aac 


tec 


ctg 


tat 


ttg 


caa 


atg 


aac 


agt 


ctg 


aga 


act 


401 


Asp 


Asn 


Ala 


Lys 


Asn 


Ser 


Leu 


Tyr 


Leu 


Gin 


Met 


Asn 


Ser 


Leu 


Arg 


Thr 








75 










80 










85 










gag 


gac 


acg 


gcc 


ttc 


tat 


ttc 


tgt 


gca 


aaa 


get 


cgc 


999 


etc 


ttt 


age 


449 


Glu 


Asp 


Thr 


Ala 


Phe 


Tyr 


Phe 


Cys 


Ala 


Lys 


Ala 


Arg 


Gly 


Leu 


Phe 


Ser 






90 










95 










100 












gat 


ace 


tgg 


ccc 


tac 


vnn 


cac 


tac 


get 


atg 


gac 


gtc 


tgg 


ggc 


caa 


999 


497 


Asp 


Thr 


Trp 


Pro 


Tyr 


Xaa 


His 


Tyr 


Ala 


Met 


Asp 


Val 


Trp 


Gly 


Gin 


Gly 




105 










110 










115 










120 




acc 


acg 


gtc 


acc 


gtc 


tec 


tea 


gcc 


tec 


acc 














527 


Thr 


Thr 


Val 


Thr 


Val 


Ser 


Ser 


Ala 


Ser 


Thr 

















125 130 



<210> 111 
<211> 154 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 80.. 154 



<221> sig_peptide 

<222> 80. .121 

<223> Von Heijne matrix 

score 8.89999961853027 
seq LLVFPVLWTCSLA/LL 



<400> 111 

ctggaaaggg aggagccaaa aggggaaege tttcttgatt gteccagcet eattaggage 60 
taccacaggg etctcctgc atg etc ctt gtt ttc ttt gtg etc tgg act tgc 112 

Met Leu Leu Val Phe Phe Val Leu Trp Thr Cys 
-10 -5 
tea ctt gca ctg ctt get tct tec cca ate gem gcc yac cca 154 
Ser Leu Ala Leu Leu Ala Ser Ser Pro He Ala Ala Xaa Pro 
15 10 
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<210> 112 
<211> 441 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 59. .439 



<221> sig_peptide 

<222> 59. .115 

<223> Von Heijne matrix 

score 8.89999961853027 
seq ILLLV7^TDASS/QM 



<400> 112 

atcacccatc aaccacatcc ctcctctaga gagtcccctg aaagcacagc tcctcacc 58 

atg gac tgg acc tgg aga ate etc etc ttg gtg gca gca gcc aca gat 106 

Met Asp Trp Thr Trp TVrg lie Leu Leu Leu Val Ala Ala Ala Thr Asp 

-15 -10 -5 

gcc tec tec cag atg cag ctg ttg cag tct ggg cct gaa gtg aag aag 154 

Ala Ser Ser Gin Met Gin Leu Leu Gin Ser Gly Pro Glu Val Lys Lys 

15 10 

act ggg tec tea gtg aaa ett tec tgc aeg gcc tec ggc gac ace etc 202 

Thr Gly Ser Ser Val Lys Leu Ser Cys Thr Ala Ser Gly Asp Thr Leu 

15 20 25 

gcc tac cac tac ctg cac tgg gtg ega cag gcc cec gga caa gcg ctt 250 

Ala Tyr His Tyr Leu His Trp Val Arg Gin Ala Pro Gly Gin Ala Leu 
30 35 40 45 

gag tgg atg gga tgg ate aca cct ttc agt gga gac acc aac ttc gca 298 

Glu Trp Met Gly Trp He Thr Pro Phe Ser Gly Asp Thr Asn Phe Ala 

50 55 60 

cag ega ttc cag gac aga etc acc ttc acc agg gac agg tct atg age 346 

Gin Arg Phe Gin Asp Arg Leu Thr Phe Thr Arg Asp Arg Ser Met Ser 

65 70 75 

aca gtc tac atg acc ctg acc age ctg ata tct gaa gac aca gee atg 394 

Thr Val Tyr Met Thr Leu Thr Ser Leu He Ser Glu Asp Thr Ala Met 

80 85 90 

tat tac tgt gcc act gat gga cgt cgc acc aac cgt ctt ttt gaa ca 441 

Tyr Tyr Cys Ala Thr Asp Gly Arg Arg Thr Asn Arg Leu Phe Glu 

95 100 105 



<210> 113 
<211> 369 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 164 . .367 



<221> sig_j>eptide 

<222> 164. .217 

<223> Von Heijne matrix 

score 8.80000019073486 
seq LGCLLWLLTHIKA/QD 

<221> misc_feature 
<222> 290. .292 
<223> n=a, g, c or t 



<400> 113 
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cagtttcagt ttctctccct tcctagtaga gacaaaaagg agacacattt tatccgtgca 60 
tccaaagact ccgatgttgg tcatggactt gggaagacag tcttcccttg gcgtttgatc 120 
actgcggaga tgccttcctt gatcattcac ccacattccc ttg atg gca ggt caa 175 

Met Ala Gly Gin 
-15 

ttg ctg gga tgc ctg ctt tgg ctg etc acc cac att aaa gcc cag gac 223 
Leu Leu Gly Cys Leu Leu Trp Leu Leu Thr His He Lys Ala Gin Asp 

-10 -5 1 

tea gtc agg gat gcc tac tgg aag act ggt age tgc cca cct cca ttt 271 
Ser Val Arg Asp Ala Tyr Trp Lys Thr Gly Ser Cys Pro Pro Pro Phe 

5 10 15 

etc cat gtg tct acc ttc nnn kkt aaa ctt acc ttc tec act aag ggc 319 
Leu His Val Ser Thr Phe Xaa Xaa Lys Leu Thr Phe Ser Thr Lys Gly 

20 25 30 

aac ctt ctg cat tec att cct etc tct tec cec tta gee tgt gtt ctt 367 
Asn Leu Leu His Ser He Pro Leu Ser Ser Pro Leu Ala Cys Val Leu 
35 40 45 50 

ag 369 



<210> 114 
<211> 334 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 20. .334 



<221> sig_peptide 

<222> 20. .292 

<223> Von Heijne matrix 

score 8.80000019073486 
seq LFLMLLGELGVFA/SY 



<221> misc_feature 
<222> 295 

<223> n=a, g, c or t 



<400> 114 

agctctgaat tgggaaggg atg aag gag get gtg cct ccg ggt tgc acg aag 52 

Met Lys Glu Ala Val Pro Pro Gly Cys Thr Lys 
-90 -85 
agt ccg agt cat ttc tea gaa ggt ttt gat agg tgg gcc tta gag gag 100 
Ser Pro Ser His Phe Ser Glu Gly Phe Asp Arg Trp Ala Leu Glu Glu 
-80 -75 -70 -65 

acg ccg ccg gaa aac ctg att ggc gcc etc ttg gcg ate ttc ggg cac 148 
Thr Pro Pro Glu Asn Leu He Gly Ala Leu Leu Ala He Phe Gly His 

-60 -55 -50 

etc gtg gtc age att gca ctt aac etc cag aag tac tgc cac ate cgc 196 
Leu Val Val Ser He Ala Leu Asn Leu Gin Lys Tyr Cys His He Arg 

-45 -40 -35 

ctg gca ggc tec aag gat ccc egg gee tat ttc aag ace aag aca tgg 244 
Leu Ala Gly Ser Lys Asp Pro Arg Ala Tyr Phe Lys Thr Lys Thr Trp 

-30 -25 -20 

tgg ctg ggc ctg ttc ctg atg ctt ctg ggc gag ctg ggt gtg ttc gem 292 
Trp Leu Gly Leu Phe Leu Met Leu Leu Gly Glu Leu Gly Val Phe Ala 

-15 -10 -5 

ten tac gcc ttc gcg ccg ctg tea etc ate gtg cec etc age 334 
Ser Tyr Ala Phe Ala Pro Leu Ser Leu He Val Pro Leu Ser 
15 10 



<210> 115 
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<211> 153 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 21. .152 

<22l> sig_j)eptide 

<222> 21. .74 

<223> Von Heijne matrix 

score 8.80000019073486 

seq LLSCWALLGTTFG/CG 

<400> 115 

acacccctgc cagcggcacc atg get ttc etc tgg etc etc tec tgc tgg gee 53 

Met Ala Phe Leu Trp Leu Leu Ser Cys Trp Ala 
-15 -10 
etc ctg ggt acc acc ttc ggc tgc ggg gtc ccc gcc ate cac cct ggc 101 
Leu Leu Gly Thr Thr Phe Gly Cys Gly Val Pro Ala He His Pro Gly 

-5 15 
tgc caa ctg age ccg egg etc eet ccg ace ctg etc ccc aea gag cgc 149 
Cys Gin Leu Ser Pro Arg Leu Pro Pro Thr Leu Leu Pro Thr Glu Arg 
10 15 20 25 

999 g 153 
Gly 

<210> 116 
<211> 292 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 47. .292 

<221> sig_peptide 

<222> 47. .106 

<223> Von Heijne matrix 

score 8.80000019073486 

seq LWLFFVLNLGSFA/FS 

<400> 116 

taccagtaae ttctttcatg gttcaataaa atcatagctt tagttt atg gca eet 55 

Met Ala Pro 
-20 

ctt aat tta ggw agt ttt 103 
Leu Asn Leu Gly Ser Phe 
-5 

tac aea att eat ttt ggt 151 
Tyr Thr He His Phe Gly 
10 15 
ggt get gaa atg tgt gtg 199 
Gly Ala Glu Met Cys Val 
30 

Ctt ggt tat ttc ttc agt 247 
Leu Gly Tyr Phe Phe Ser 
45 

cgt cct cct age ggg 292 
Arg Pro Pro Ser Gly 
60 



ttt caa aac ttc ctt 
Phe Gin Asn Phe Leu 
-15 

get ttt age tea mtt 
Ala Phe Ser Ser Xaa 

1 

cct aat ttc ttt act 
Pro Asn Phe Phe Thr 
20 

tat gta ttt aac ttc 
Tyr Val Phe Asn Phe 
35 

tat gat att ctg gat 
Tyr Asp He Leu Asp 
50 



tgg ctt ttc ttt gtg 
Trp Leu Phe Phe Val 
-10 

ccd aat tet ctt ttt 
Pro Asn Ser Leu Phe 

5 

tta eta tat aaa caa 
Leu Leu Tyr Lys Gin 
25 

etc tac cca ttt get 
Leu Tyr Pro Phe Ala 
40 

ttg cca gtc akt gtc 
Leu Pro Val Xaa Val 
55 



<210> 117 
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<211> 304 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 141. -302 

<221> sig_peptide 

<222> 141, .245 

<223> Von Heijne matrix 

score 8.69999980926514 

seq LLLSVAFNQLVFA/LY 

<400> 117 

tttctcatca atttcttgct tctctggcaa cctcaacctc tgattcctga ggccaataaa 60 

actgaaactt tctgcttgag ctcttgtttt gccaggctga tggggctgag gtgcaccctc 120 
tgaggaaaag ctgtaaatac atg gat ttt acc caa tgc cat tec ctt ctt tta 173 

Met Asp Phe Thr Gin Cys His Ser Leu Leu Leu 











-35 








-30 








-25 




agg gtt 


gaa 


tat 


tct 


cca 


gtg 


tct 


gtc 


tgc 


ttt 


tta 


tta 


ctt 


tec 


gtt 


221 


Arg Val 


Glu 


Tyr 


Ser 


Pro 


Val 


Ser 


Val 


Cys 


Phe 


Leu 


Leu 


Leu 


Ser 


Val 










-20 










-15 










-10 






gcc ttc 


aat 


cag 


ttg 


gtt 


ttt 


get 


ttg 


tat 


cca 


ata 


caa 


get 


acw 


btc 


269 


Ala Phe 


Asn 


Gin 


Leu 


Val 


Phe 


Ala 


Leu 
1 


Tyr 


Pro 


lie 


Gin 
5 


Ala 


Thr 


Xaa 




tgt ttc 


tct 


-5 
dda 


gtt 


tct 


etc 


cet 


ttc 


ccc 


get 


ca 










304 


Cys Phe 


Ser 


Xaa 


Val 


Ser 


Leu 


Pro 


Phe 


Pro 


Ala 















10 15 



<210> 118 
<211> 145 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 76. .144 

<221> sig_peptide 
<222> 76. .120 
<223> Von Heijne matrix 
score 8.5 

seq LLLLACGVPSLWP/FA 

<400> 118 

gtgaaggtag gagtgttggg gccctgacec ccgcaggagg gatgggcgga ttcgaggact 60 
ggctgcctgc ccatc atg etc ttg etc etg etg gcc tgt ggt gtt ccc age 111 
Met Leu Leu Leu Leu Leu Ala Cys Gly Val Pro Ser 
-15 -10 -5 

etg tgg ccc ttt gcw ctt get etc tta aag acc c 145 
Leu Trp Pro Phe Ala Leu Ala Leu Leu Lys Thr 
1 5 

<210> 119 
<211> 288 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 160. .288 



wo 99/53051 

<22l> sig_peptide 
<222> 160 . .228 
<223> Von Heijne matrix 
score 8 . 5 

seq SFLLLHLCQVLLS/RR 
<400> 119 

tctttagtcc tgtttatatg atgaatcaca tttattgatt tgcatatgtt gaaccatcct 60 
tgtatcccag ggataaagcc tacttgattg taatggataa gcttcatgat gtgctgctga 120 
atttggtttg ccagtatttt gttaaggatt tttacatca atg ttc att gag aat 174 

Met Phe lie Glu Asn 
-20 

att ggv ctg aag ttt tct ttt ttg ttg ttg cat etc tgc cag gtt ttg 222 
lie Gly Leu Lys Phe Ser Phe Leu Leu Leu His Leu Cys Gin Val Leu 

-15 -10 -5 

eta tea aga cga get ggt acc att cot act gaa aea att cca aaa aaa 270 
Leu Ser Arg Arg Ala Gly Thr lie Pro Thr Glu Thr lie Pro Lys Lys 

15 10 
ttg agg agg aga gac ggg 288 
Leu Arg Arg Arg Asp Gly 
15 20 

<210> 120 
<211> 386 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 71. .385 

<221> sig_j>eptide 
<222> 71. .142 
<223> Von Heijne matrix 
score 8 . 5 

seq XALLMGFLMVCLG/AF 
<400> 120 

aat to tct tg gagcaagcag ggaagcagag gagcagcagg gtcagggtgc tgggttccta 60 
aggtgcaagg atg cag aac aga act ggc etc att etc tgt get ytt gee 109 
Met Gin Asn Arg Thr Gly Leu lie Leu Cys Ala Xaa Ala 

-20 -15 



etc 


ctg 


atg 


ggt 


ttc 


ctg 


atg 


gtc 


tgc 


ctg 


ggg 


gee 


ttc 


ttc att tec 


157 


Leu 


Leu 


Met 


Gly Phe 


Leu 


Met 


Val 


Cys 


Leu 


Gly 


Ala 


Phe 


Phe He Ser 






-10 










-5 










1 




5 




tgg 


ggc 


tec 


ata 


ttc 


gac 


tgt 


cag 


ggg 


age 


ctg 


att 


gcg 


gee .tat ttg 


205 


Trp 


Gly 


Ser 


lie 


Phe 


Asp 


Cys 


Gin 


Gly 


Ser 


Leu 


He 


Ala 


Ala Tyr Leu 












10 










15 








20 




ctt 


ctg 


cct 


ctg 


ggg 


ttt 


gtg 


ate 


ctt 


ctg 


agt 


gga 


att 


ttc tgg age 


253 


Leu 


Leu 


Pro 


Leu 


Gly 


Phe 


Val 


lie 


Leu 


Leu 


Ser 


Gly 


He 


Phe Trp Ser 










25 










30 










35 




aac 


tat 


cgc 


cag 


gtg 


act 


gaa 


age 


aaa 


gga 


gtg 


ttg 


agg 


cac atg etc 


301 


Asn 


Tyr 


Arg 


Gin 


Val 


Thr 


Glu 


Ser 


Lys 


Gly 


Val 


Leu 


Arg 


His Met Leu 








40 










45 










50 






cga 


caa 


cac 


ctt 


get 


cat 


ggg 


gee 


ctg 


ccc 


gtg 


gee 


aea 


gta gac agt 


349 


Arg 


Gin 


His 


Leu 


Ala 


His 


Gly 


Ala 


Leu 


Pro 


Val 


Ala 


Thr 


Val Asp Ser 






55 










60 










65 








get 


get 


ctt 


ctg 


aaa 


ate 


atg 


tgt 


aag 


ear 


ttg 


ctt 


t 




386 


Ala 


Ala 


Leu 


Leu 


Lys 


lie 


Met 


Cys 


Lys 


Gin 


Leu 


Leu 








70 










75 










80 
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76 



<210> 121 
<211> 190 



wo 99/53051 PCT/IB99/00712 

77 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 
<222> 34. .189 



<221> sigjeptide 
<222> 34. .165 
<223> Von Heijne matrix 
score 8.5 

seq LTCTSSLLSPALG/RS 



<400> 121 

atcttgaaaa cggaaaataa aaacagcaga cct atg aag gtc gaa ggg gaa gaa 54 

Met Lys Val Glu Gly Glu Glu 
-40 



aag 


ctg tat cga 


ttg 


ttg 


aga tot ggc gac 


ttg 


ttt 


aaa 


ttt 


cat 


cag 


102 


Lys 


Leu Tyr Arg 


Leu 


Leu 


Arg Ser Gly Asp 


Leu 


Phe 


Lys 


Phe 


His 


Gin 






-35 






-30 






-25 










cct 


cac ttc tat 


gaa 


etc 


tea ggc etc acg 


tgt 


acc 


age 


tct 


ctg 


etc 


150 


Pro 


His Phe Tyr 


Glu 


Leu 


Ser Gly Leu Thr 


Cys 


Thr 


Ser 


Ser 


Leu 


Leu 






-20 






-15 




-10 












tec 


ttt gcc ttg 


gga 


cgt 


tec ate cct gga 


agt 


ttc 


cca 


g 






190 


Ser 


Phe Ala Leu 


Gly 


Arg 


Ser lie Pro Gly 


Ser 


Phe 


Pro 










-5 






1 


5 

















<210> 122 
<211> 211 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 32. .211 



<221> sig_j)eptide 
<222> 32. .88 
<223> Von Heijne matrix 
score 8.5 

seq LLLFSGAVALIQT/WA 



<400> 122 

agattctccc cagacgccaa ggttgcgggt c atg gag tec cga acc etc etc 52 

Met Glu Ser Arg Thr Leu Leu 
-15 



ctg 


ctg 


ttc 


teg 


gga 


gcc 


gtg gcc 


ctg 


ate 


cag ace tgg gca ggt 


gag 


Leu 


Leu 


Phe 


Ser 


Gly 


Ala 


Val 


Ala 


Leu 


lie 


Gin Thr Trp Ala Gly Glu 






-10 










-5 






1 




tgc 


999 


gtc 


999 


agg 


gaa 


aag 


gcc 


tct 


gcg 


gga agg age gag ggg 


ccc 


Cys 


Gly 


Val 


Gly 


Arg 


Glu 


Lys 


Ala 


Ser Ala Gly Arg Ser Glu Gly 


Pro 


5 










10 










15 


20 


gcc 


egg 


agg 


agt 


aaa 


tct 


gca 


cat 


ata 


kbt 


aat tac aga tta caa 


tta 


Ala 


Arg 


Arg 


Ser 


Lys 

25 


Ser 


Ala 


His 


lie 


Xaa 
30 


Asn Tyr Arg Leu Gin 
35 


Leu 


caa 


tea 


agg 


cag 


ggg 
















Gin 


Ser 


Arg 


Gin 
40 


Gly 

















<210> 123 
<211> 353 
<212> DNA 
<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 249- .353 

<221> sig_peptide 

<222> 249. .296 

<223> Von Heijne matrix 

score 8.39999961853027 

seq SVPLLCFWSLCYC/FA 

<221> misc_feature 
<222> 187 

<223> n=a, g, c or t 



<400> 123 

agcgagtcct tgcctcccgg cggctcagga cgagggcaga tctcgttctg gggcaagccg 60 
ttgacactcg ctccctgcca ccgcccgggc tccgtgccgc caagttttca ttttccacct 120 
tctctgcctc cagtccccca gcccctggcc gagagaaggg tcttaccggc cgggattgct 180 
ggaaacncaa gaggtggttt ttgtttttta aaacttctgt ttcttgggag ggggtgtggc 24 0 
ggggcagg atg age aac tec gtt cct ctg etc tgt ttc tgg age etc tgc 290 
Met Ser Asn Ser Val Pro Leu Leu Cys Phe Trp Ser Leu Cys 
-15 -10 -5 

tat tgc ttt get gcg ggg age ecc gta cct ttt ggt cea gag gga egg 338 
Tyr Cys Phe Ala Ala Gly Ser Pro Val Pro Phe Gly Pro Glu Gly Arg 

15 10 
ctg gaa gat aag etc 353 
Leu Glu Asp Lys Leu 
15 

<210> 124 
<211> 249 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 93 . .248 

<221> sig_peptide 

<222> 93 . .134 

<223> Von Heijne matrix 

score 8.39999961853027 

seq PWTILLFAAGSLA/IP 

<400> 124 

tttttccccg atctggcctc acaggaggag ttggcgggga gccttgggcc cctctggcct 60 
cagceggatt teccagceaa acgcagagag ag atg ecc tgg acc ate ttg etc 113 

Met Pro Trp Thr lie Leu Leu 
-10 

ttt gca get ggc tec ttg gcg ate cea gea cea tec ate egg gtg gtg 161 
Phe Ala Ala Gly Ser Leu Ala lie Pro Ala Pro Ser He Arg Val Val 

-5 15 
ecc ccg tac cea age age caa gag gae ecc ate cac ate gca tgc atg 2 09 

Pro Pro Tyr Pro Ser Ser Gin Glu Asp Pro He His He Ala Cys Met 
10 15 20 25 

gcc get ggg aac ttc ccg ggg gcg aat ttc aca ctg tat e 249 
Ala Ala Gly Asn Phe Pro Gly Ala Asn Phe Thr Leu Tyr 
30 35 



<210> 125 
<211> 375 
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<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 175. .375 

<221> sigjpeptide 

<222> 175. .366 

<223> Von Heijne matrix 

score 8.39999961853027 

seq GFLFFGFLFPVFS/FP 

<400> 125 

gtgctgcggc attcacgtga tctgcacggg cgcagatgta ggcaccggtc cgagtgcctg 60 
ccctctgtcc ccgcggctgg gtctcgtctg ctccggttcc tgggctccta attcttggtc 120 
cagcttcttc caggtcagtg tgcgggcctt ccacgctgcc agcggaacac tgga atg 177 

Met 



gcg 


gaa 


ggg gaa 


egg 


gtc 


tgc 


gcg 


tct 


gtk 


gtt 


ccc age 


get 


ctg 


cga 


225 


Ala 


Glu 


Gly Glu 


Arg 


Val 


Cys 


Ala 


Ser 


Val 


Val 


Pro Ser 


Ala 


Leu 


Arg 








-60 










-55 








-50 








acg 


ctg 


aaa agg 


agg 


age 


aac 


ctg 


tec 


aga 


ate 


ccc gca 


gga 


cag 


gaa 


273 


Thr 


Leu 


Lys Arg 


Arg 


Ser 


Asn 


Leu 


Ser Arg 


He 


Pro Ala Gly Gin Glu 








-45 








-40 








-35 










aag 


gag 


ggg aaa 


tct 


cga 


cat 


gtt 


get 


ccc 


cct 


ttt cgc 


ttt 


ttc 


cct 


321 


Lys 


Glu 


Gly Lys 


Ser 


Arg 


His 


Val 


Ala 


Pro 


Pro 


Phe Arg 


Phe 


Phe 


Pro 






-30 








-25 










-20 










ttt 


tec 


ggt ttt 


ttg 


ttt 


ttt 


ggt 


ttt 


ctt 


ttt 


ccc gtc 


ttt 


tct 


ttc 


369 


Phe 


Ser 


Gly Phe 


Leu 


Phe 


Phe 


Gly 


Phe 


Leu 


Phe 


Pro Val 


Phe 


Ser 


Phe 




-15 








-10 










-5 








1 





ccc tec 375 
Pro Ser 

<210> 126 
<211> 437 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 223 . .435 

<221> sig_peptide 

<222> 223. .261 

<223> Von Heijne matrix 

score 8.39999961853027 

seq MFCLAAILASASA/QR 

<221> misc_feature 
<222> 404 

<223> n=a, g, c or t 



<400> 126 

tcaataccca tgtgaacagt ttcgtggagg gttttaagta ttttccactg gctggctttg 60 
ggtataagta ectttccttc ttctgtcgtt aaccaegccg aggggagaaa actatgcccc 120 
cgtgaaagtc cccactctgt ttcggttggg gaatactgga gcttaacctc ttggaggggg 180 
ttgttccata ccaagggtcc ttccgtaggt atttctaatg gg atg ttc tgc ctg 234 

Met Phe Cys Leu 
-10 

gca gca att tta gee tea gca tct gee caa egg ttt cct tct gcc ttt 282 
Ala Ala He Leu Ala Ser Ala Ser Ala Gin Arg Phe Pro Ser Ala Phe 
-5 15 



wo 99/53051 

tct cct tea cct tty yga 
Ser Pro Ser Pro Phe Xaa 
10 

ttg ggt ttt trc act gtg 
Leu Gly Phe Xaa Thr Val 
25 

tta ayg ggr gtt ccc cca 
Leu Xaa Gly Val Pro Pro 
40 45 
tgc age atg gg 
Cys Ser Met 

<210> 127 
<211> 304 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 24. .302 



80 

tgg ctt yrg car tgt aas 
Trp Leu Xaa Gin Cys Xaa 
15 

tgy art aac tec ata att 
Cys Xaa Asn Ser lie lie 
30 35 
gag gtt ang gaa etc cct 
Glu Val Xaa Glu Leu Pro 
50 



PCT/IB99/00712 

act gee ace tec 330 

Thr Ala Thr Ser 

20 

tec ttg tgg tat 378 
Ser Leu Trp Tyr 

tte ttt cca tat 426 
Phe Phe Pro Tyr 
55 

437 



<221> sigjpeptide 

<222> 24. .74 

<223> Von Heijne matrix 

score 8.39999961853027 
seq TLLLLLSEALALT/QT 



<400> 127 

eteaggaete agaggctggg ate atg gta gat gga ace etc ctt tta etc etc 53 

Met Val Asp Gly Thr Leu Leu Leu Leu Leu 
-15 -10 
teg gaa gee ctg gee ctt ace car ace tgg gcg ggc tec cac tec tkr 101 
Ser Glu Ala Leu Ala Leu Thr Gin Thr Trp Ala Gly Ser His Ser Xaa 

-5 15 
aag tat ttc cac act tec gtg tec egg mce ggc egc ggg gag ccc cge 14 9 

Lys Tyr Phe His Thr Ser Val Ser Arg Xaa Gly Arg Gly Glu Pro Arg 
10 15 20 25 

ttc ate tct gtg ggc tae gtg gae gae ace egg tea gag tat tgg gae 197 
Phe lie Ser Val Gly Tyr Val Asp Asp Thr Arg Ser Glu Tyr Trp Asp 

30 35 40 

egg gag aca egg age gee agg gae ace gea cag att ttc cga gtg aac 245 
Arg Glu Thr Arg Ser Ala Arg Asp Thr Ala Gin lie Phe Arg Val Asn 

45 50 55 

ctg egg acg ctg cge ggc tae tac aat cag age gag gee ggg tct cam 293 
Leu Arg Thr Leu Arg Gly Tyr Tyr Asn Gin Ser Glu Ala Gly Ser Xaa 

60 65 70 

aec ctg cag tg 304 
Thr Leu Gin 
75 



<210> 128 
<211> 244 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 19. .243 



<221> sigjpeptide 

<222> 19. .99 

<223> Von Heijne matrix 

score 8.39999961853027 
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seq LVLSLISLSIAWS/MV 

<221> misc_feature 
<222> 112 

<223> n=a, g, c or t 



<400> 128 

gcgattaggt tttaatgt atg 

Met 

tea cac age ctt gtg tta 
Ser His Ser Leu Val Leu 
-15 

atg gtc gaa atg nbc act 
Met Val Glu Met Xaa Thr 
1 5 
aga ate eta gtt gtg eag 
Arg lie Leu Val Val Gin 
20 

cac agg tgt gca eta ggg 
His Arg Cys Ala Leu Gly 
35 



aat ttc agg ggg cca caa 
Asn Phe Arg Gly Pro Gin 
-25 

tec eta ate agt etc tec 
Ser Leu lie Ser Leu Ser 
-10 -5 
tct gca age tac aar caa 
Ser Ala Ser Tyr Lys Gin 
10 

ttg ccc aca tgg gtg gaa 
Leu Pro Thr Trp Val Glu 
25 

aga aag aat tgt tct att 
Arg Lys Asn Cys Ser lie 
40 



acg ttc agt ctt 51 

Thr Phe Ser Leu 

-20 

att gca tgg tct 99 
lie Ala Trp Ser 

aag ttt gee ctt 147 
Lys Phe Ala Leu 
15 

tgt cca gta aac 195 
Cys Pro Val Asn 
30 

agg ace eag cca c 244 

Arg Thr Gin Pro 

45 



<210> 129 

<211> 232 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 156. .230 



<221> sig_peptide 

<222> 156. .215 

<223> Von Heijne matrix 

score 8.39999961853027 
seq SCICLFLPSLIHS/PP 



<400> 129 

ctacaggaag gaaaagtgtg acagctttga aaaagaaaga gggtaaaata ttttaaccac 60 
ccttggtgtc atttgtggca geetatagea ttagagcctt tgagaaeaga tctttecaga 120 
ttctgcttaa gtccagggat tetgtgaccg cagaa atg act ggc ate tec ate 173 

Met Thr Gly He Ser He 
-20 -15 
tgc teg tgc ate tgt ttg ttt ctt cct tea ttg att cac tea ttc ccc 221 
Cys Ser Cys He Cys Leu Phe Leu Pro Ser Leu He His Ser Phe Pro 

-10 -5 1 

eeg ccc tgc gg 232 
Pro Pro Cys 
5 



<210> 130 
<211> 312 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 17. ,310 



<221> sig_peptide 
<222> 17. .94 
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<223> Von Heijne matrix 

score 8.30000019073486 
seq FLLIiVAAPRWVQL/QE 



<400> 130 

atgctttctg agagtc atg gac etc ctg tgc aag aac atg aag cac ctg tgg 52 

Met Asp Leu Leu Cys Lys Asn Met Lys His Leu Trp 
-25 -20 -15 



ttc 


ttc 


etc 


ctg 


ctg 


gtg 


gcg get 


ccc aga tgg gtc cag 


ctg 


cag 


gag 


100 


Phe 


Phe 


Leu 


Leu 


Leu 


Val 


Ala 


Ala 


Pro Ara Tro Val Gin 


Leu 


Gin Glu 












-10 








-5 




1 






teg 


ggc 


cca 


cgc 


ctg 


gtg 


agg 


ect 


ceg gag acc ctg aag 


ect 


teg 


gag 


148 


Ser 


Gly 


Pro 
5 


Arg 


Leu 


Val 


Arg 


Pro 
10 


Pro Glu Thr Leu Lys 
15 


Pro 


Ser 


Glu 




acc 


ctg 


tec 


etc 


acc 


tgc 


act 


att 


tct ggt gac tec atg 


age 


agt 


get 


196 


Thr 


Leu 
20 


Ser 


Leu 


Thr 


Cys 


Thr 
25 


lie 


Ser Gly Asp Ser Met 

30 


Ser 


Ser 


Ala 




tet 


tac 


tat 


tgg 


gee 


tgg 


ate 


cgc 


cag ccc cca ggc aag 


ggc 


ctg 


gaa 


244 


Ser 


Tyr 


Tyr 


Trp 


Ala 


Trp 


lie 


Arg 


Gin Pro Pro Gly Lys 


Gly Leu Glu 




35 










40 






45 






50 




ttc 


att 


ggg 


cgt 


gee 


tta 


tat 


agt 


ggg acc acc gac tac 


aat 


ceg 


tec 


292 


Phe 


lie 


Gly 


Arg 


Ala 


Leu 


Tyr Ser 


Gly Thr Thr Asp Tyr 


Asn 


Pro 


Ser 












55 








60 




65 






etc 


age 


agt 


eg a 


ate 


acc 


ct 












312 


Leu 


Ser 


Ser 


Arg 
70 


lie 


Thr 

















<210> 131 
<211> 276 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 119. .274 



<221> sigjpeptide 

<222> 119. .253 

<223> Von Heijne matrix 

score 8.19999980926514 
seq PLSLSLCLSLCHT/HT 



<400> 131 

gccttcatct ctccatctct gcgctgctgc cggctgcgcc atecagcacc cagactccag 60 

caccggccga ggacccccac teeggetgca gggaccctgt cccagcgaga ccgcaggc 118 

atg tea tec gaa aag tea gga etc cca gac tea gtc ect cac act tct 166 

Met Ser Ser Glu Lys Ser Gly Leu Pro Asp Ser Val Pro His Thr Ser 

-45 -40 -35 -30 

ceg ceg ccc tac aat gee ect cag ect cca gee gaa ccc cca gee ceg 214 

Pro Pro Pro Tyr Asn Ala Pro Gin Pro Pro Ala Glu Pro Pro Ala Pro 

-25 -20 -15 

ect etc tct etc tct etc tgt etc tct etc tgt cac aca cac aca cac 262 
Pro Leu Ser Leu Ser Leu Cys Leu Ser Leu Cys His Thr His Thr His 

-10 -5 1 

aca cac aca cac ac 276 
Thr His Thr His 
5 



<210> 132 
<211> 174 
<212> DNA 
<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 35 . - 172 



<221> sig_peptide 

<222> 35. .118 

<223> Von Heijne matrix 

score 8.19999980926514 

seq LVSLLMQPEGALG/EE 

<400> 132 

actctgctga gctcctctgc acctgcccag gacc atg acg cct get ctg cgc tgc 55 

Met Thr Pro Ala Leu Arg Cys 

-25 

gca ttc get etg gee ata gcg ggc etc gtg teg etg ctg atg eag ccc 103 
Ala Phe Ala Leu Ala lie Ala Gly Leu Val Ser Leu Leu Met Gin Pro 

-20 -15 -10 

gag ggc gee etc ggc gag gag get gca agt gee gca gcc cag ggc cgc 151 
Glu Gly Ala Leu Gly Glu Glu Ala Ala Ser Ala Ala Ala Gin Gly Arg 
-5 15 10 

cag ttg get gaa ett agg etc ca 174 
Gin Leu Ala Glu Leu Arg Leu 
15 



<210> 133 

<211> 344 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 133. .342 



<221> sig_peptide 

<222> 133. .246 

<223> Von Heijne matrix 

score 8.19999980926514 
seq LLLIFLSFPYTLC/IL 



<400> 133 

gcctttcact tgcacaaaca ctgttattat gatcacttat ccaactgaca tttttcagac 60 
ettttaactt caaetgttct tttttcetgt aaatcttaat tttetttttt tttcteeeaa 120 
tttttctect ac atg tet gga etc ttc cea gtt cct gtc aga gta aat gtt 171 
Met Ser Gly Leu Phe Pro Val Pro Val Arg Val Asn Val 
-35 -30 
gat att gee cag aac ata act tgc tet tec ttt tct etc ctt etc att 219 
Asp lie Ala Gin Asn lie Thr Cys Ser Ser Phe Ser Leu Leu Leu He 
-25 -20 -15 -10 

ttt ctt tct ttc eec tac ace etc tgt ata etc tat aga gta aaa tea 267 
Phe Leu Ser Phe Pro Tyr Thr Leu Cys He Leu Tyr Arg Val Lys Ser 

-5 15 
tat aca ccc acg gag tea ata act gcc ttt aat eta aca att ggg wga 315 
Tyr Thr Pro Thr Glu Ser He Thr Ala Phe Asn Leu Thr He Gly Xaa 

10 15 20 

ttc cea tat ctt taw wtt tew acc ccg gg 344 
Phe Pro Tyr Leu Xaa Xaa Ser Thr Pro 
25 30 



<210> 134 
<211> 244 
<212> DNA 

<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 128. .244 



<22l> sig_peptide 

<222> 128. .226 

<223> Von Heijne matrix 

score 8.19999980926514 
seq HAIiSliCLCTCAFA/FL 



<400> 134 

aagcaagaga ggggtgttca ggatgataaa gtcctggttg atgaaggcag atgcctgcag 50 
ctcttccctg gggcagggct ggcttccata gggtgcttgt ttgggccctt tggaaggggg 120 
tgtgcgg atg tgc agg get get tgt ate att aga atg get gtt aga att 169 
Met Cys Arg Ala Ala Cys lie lie Arg Met Ala Val Arg lie 
-30 -25 -20 

tea ttc ttt ctt tct tac cat get ctg tct etc tgc ett tgt aca tgt 217 
Ser Phe Phe Leu Ser Tyr His Ala Leu Ser Leu Cys Leu Cys Thr Cys 

-15 -10 -5 

geg ttt gea ttt etc tee ete etc ggg 244 
Ala Phe Ala Phe Leu Ser Leu Leu Gly 
1 5 

<210> 135 
<211> 217 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 40. .216 



<22l> sig_peptide 

<222> 40. .90 

<223> Von Heijne matrix 

score 8,19999980926514 
seq LLXALGFLXOVNP/XP 



<400> 135 

attaaaccac caccagstcc ccaagccacc ccttcagce atg aag ttc ctg etc 54 

Met Lys Phe Leu Leu 
-IS 



ctg gma 


gee 


etc 


gga 


ttc 


ctg 


ame 


cag 


gtg 


aat 


cec are 


cea 


att sma 


Leu Xaa 


Ala 
-10 


Leu 


Gly 


Phe 


Leu 


Xaa 
-5 


Gin 


Val 


Asn 


Pro Xaa 
1 


Pro 


lie Xaa 


ggd ggg 


tea 


aaa 


atg 


tgt 


gag 


twa 


cac 


cce 


agg 


ata ctg 


cag 


gac atg 


Gly Gly 


Ser 


Lys 


Met 


Cys 


Glu 


Xaa 


His 


Pro 


Arg 


lie Leu Gin Asp Met 


5 








10 










15 






20 


ttg cea 


ctg 


ggg 


gga 


gac 


age 


att 


gtt 


cat 


gtg 


caa cgc 


tks 


cag aaa 


Leu Pro 


Leu 


Gly 


Gly 
25 


Asp 


Ser 


lie 


Val 


His 
30 


Val 


Gin Arg 


Xaa 


Gin Lys 
35 


atg ctg 


cat 


cag 


yta 


etc 


c 
















Met Leu 


His 


Gin 
40 


Leu 


Leu 



















<210> 136 
<211> 428 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 114 . .428 
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<221> sig_peptide 

<222> 114 . .239 

<223> Von Heijne matrix 

score 8.10000038146973 

seq LFCFLLLCLSAAS/LL 

<400> 136 

aggcgtctgt gtgcgccgcc aagtcggtgg ggcggggacg cgaggtgtgg atggggggtc 60 
gccttgacct ctgcctcagc cagtagcgca gtctcggcct cgccgttacg gag atg 116 

Met 





ccc 


tgg 


gtg 


CQQ 
^ -J J 


acq 


atQ 




cag 


aacr 


CtQ 


aao 


eag 


egg 


CtQ 


ecra 


164 


Val 


Pro 


Trp 


Val 


Arg 


Thr 


Met 


Gly 


Gin 


Lys 


Leu 


Lys 


Gin 


Arg 


Leu 


Arg 






-40 










-35 










-30 












ctg gac 


gtg 


gga 


cge 


gag 


ate 


tgc 


cge 


cag 


tac 


cog 


ctg 


tte 


tgc 


tte 


212 


Leu 


Asp 


Val Gly 


Arg 


Glu 


He 


Cys 


Arg 


Gin 


Tyr 


Pro 


Leu 


Phe 


Cys 


Phe 




-25 










-20 










-15 










-10 




ctg 


ctg 


etc 


tgt 


etc 


age 


gee 


gcc 


tec 


ctg 


Ctt 


ctt 


aac 


agg 


tat 


att 


260 


Leu 


Leu 


Leu 


Cys 


Leu 
-5 


Ser 


Ala 


Ala 


Ser 


Leu 
1 


Leu 


Leu 


Asn Arg 


Tyr 


He 




cat 


att 


tta 


atg 


ate 


tte 


tgg 


tea 


ttt 


gtt 


get 


gga 


gtt 


gte 


aca 


ttt 


308 


His 


He 


Leu 


Met 


He 


Phe 


Trp 


Ser 


Phe 


Val 


Ala 


Gly 


Val 


Val 


Thr 


Phe 








10 










15 










20 










tac 


tgc 


tea 


eta 


gga 


cet 


gat 


tct 


etc 


tta 


cea 


aat 


ata 


tte 


tte 


aca 


356 


Tyr 


Cys 


Ser 


Leu 


Gly 


Pro 


Asp 


Ser 


Leu 


Leu 


Pro 


Asn 


He 


Phe 


Phe 


Thr 






25 










30 










35 












ata 


aaa 


tac 


aaa 


ccc 


aag 


eag 


tta 


gga 


ett 


cag 


gaa 


tta 


ttt 


cet 


caa 


404 


lie 


Lys 


Tyr 


Lys 


Pro 


Lys 


Gin 


Leu 


Gly 


Leu 


Gin 


Glu 


Leu 


Phe 


Pro 


Gin 




40 










45 










50 










55 




ggt 


cat 


age 


tgt 


get 


gtt 


tgt 


ggt 


















428 


Gly His 


Ser 


Cys 


Ala 


Val 


Cys 


Gly 





















60 



<210> 137 
<211> 434 
<212> DMA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 305. .433 

<221> sig_peptide 

<222> 305. .406 

<223> Von Heijne matrix 

score 8.10000038146973 

seq LLCLFXLFFFSFL/KR 

<400> 137 

tttctgtcca ttctccctea ccaccctgac gcaggctctg ggaatgtgct gaaggtgcag 60 
cagctgctec acatttgtag cgaacacttt gactccaaag agaaggagga agacaaagae 120 
aagaaggaaa agaaagacaa ggacaagaag gaagcecctg etgacatggg agcacatcag 180 
ggagtggctg ttetggggat tgcecttatt getatggggg aggagattgg tgcagagatg 240 
geattacgaa cetttggcea ettggtgagt atageatgaa gaaaattgga atatactggt 300 
tttg atg gcc tgg ggt tec cea ggg aag att ttt ctg atg ggt ttt ett 34 9 

Met Ala Trp Gly Ser Pro Gly Lys He Phe Leu Met Gly Phe Leu 
-30 -25 -20 

ggt gga gag ctg gtc ttt ttg ctg tgc ctt tte ttw ctt ttt tte ttt 397 
Gly Gly Glu Leu Val Phe Leu Leu Cys Leu Phe Xaa Leu Phe Phe Phe 

-15 -10 -5 

tct ttt ttg aag egg agt ttt get eta gag tgc aat g 434 
Ser Phe Leu Lys Arg Ser Phe Ala Leu Glu Cys Asn 
1 5 
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<210> 138 
<211> 395 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 310. .393 

<221> sig_peptide 

<222> 310 . . 357 

<223> Von Heijne matrix 

score 8.10000038146973 

seq SILLLLAPPLPSA/VS 

<221> misc_feature 
<222> 189 

<223> n=a, g, c or t 



<400> 138 

aaaagctctg taaacatata ataaatggaa ttccattgac attcaagcct tacgtatttc 60 
cagagcttct tcgacttatc ctgcctcccc tactttaatt ctgttaaagt agttgaacac 120 
cattcttctc ataatagttc tccctcsatt cttcagtgat tyccttgtgt ttataggata 180 
aagtccacnt gttattttgg cagtcagttc aagatccaca aatcagtctt tacccttaca 240 
tccttatttc tcactgctgt tctaatatag tctttatacc agtcaggctg gtctgttcac 300 
tattcctga atg ttt ttc tec att ctt ttg tta ttg gca ccc ccg eta ccc 351 
Met Phe Phe Ser lie Leu Leu Leu Leu Ala Pro Pro Leu Pro 
-15 -10 -5 

tct gca gtg tct ttg eta cct ttc ttt ttc tac tgt gtg cag gg 395 
Ser Ala Val Ser Leu Leu Pro Phe Phe Phe Tyr Cys Val Gin 
15 10 

<210> 139 
<211> 268 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 141. .266 

<221> sigjpeptide 

<222> 141. .206 

<223> Von Heijne matrix 

score 8.10000038146973 

seq LLVCSWLSISLHA/HT 

<400> 139 

caactctgct gttttgtagg aagccacatg gaggtcattt acggttacta gttatcttag 60 
tcagcttggg cagccattaa aaaataatac tgtagacgga gtggcccaaa cgagagaaat 120 
ttatttctta tagttttggc atg gta gat ttc ate etg agg tct ctt etc ttg 173 

Met Val Asp Phe He Leu Arg Ser Leu Leu Leu 
-20 -15 
gtt tgt agt tgg etg tea ate tec etg eat get eac acg ace get ttt 221 
Val Cys Ser Trp Leu Ser He Ser Leu His Ala His Thr Thr Ala Phe 

-10 -5 15 

tgt aca tac agt aag aaa ata cac act gtc atg tea ttt ttt tgt aa 268 
Cys Thr Tyr Ser Lys Lys He His Thr Val Met Ser Phe Phe Cys 
10 15 20 



<210> 140 



wo 99/53051 



<211> 170 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 93. .170 

<22l> sigjpeptide 
<222> 93. .140 
<223> Von Heijne matrix 

score 8.10000038146973 

seq LLYFLCVSSYVTS/FF 

<400> 140 

ttttgactga tatcaaattc taggtggacc gagattttct ttcagtcttt caaagatatt 60 
actctattgc cttctatctt gcatagtttc tg atg aga agt ctg ttg tat ttc 113 

Met Arg Ser Leu Leu Tyr Phe 
-15 -10 
tta tgt gtt tct tea tat gta aca tct ttt ttc ttt ttt ttt ttt ttt 161 
Leu Cys Val Ser Ser Tyr Val Thr Ser Phe Phe Phe Phe Phe Phe Phe 

-5 15 
ttt ttt ttt 170 
Phe Phe Phe 
10 

<210> 141 
<211> 396 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 192 . .395 

<221> sigjpeptide 
<222> 192 . .236 
<223> Von Heijne matrix 
score 8 

seq FISFLCLIALAGT/SS 

<400> 141 

gattctcagc ttagttgctg ttggtgtata ggagagctac tgatttgtgt acattaattt 60 
tgtatccgga aactttgttg aattatttta tcagttctag gagctttttg gaggagtctt 120 
tagggttctc taggtataca atcatatcat cagcaaacag tgacaattcg acttcctctt 180 
tatggatttg t atg ccc ttt att tct ttc ctt tgt ctg att get ctg get 230 
Met Pro Phe lie Ser Phe Leu Cys Leu lie Ala Leu Ala 









-15 








-10 


-5 








ggg 


act 


tec 


agt act 


atg 


ttg 


aga 


agt get ctg get ggg 


act 


tec 


agt 


278 


Gly Thr 


Ser 


Ser Thr 


Met 


Leu 


Arg 


Ser Ala Leu Ala Gly 


Thr 


Ser 


Ser 








1 






5 




10 










act 


atg 


tkg 


arg aga 


agt 


ggt 


gam 


agt ggg wat cct kgh ctk gty 


cma 


326 


Thr 


Met 


Xaa 


Xaa Arg 


Ser 


Gly 


Xaa 


Ser Gly Xaa Pro Xaa 


Leu 


Val 


Xaa 




15 








20 






25 






30 




gtc 


ctm 


aga 


ggg aat 


get 


ttc 


age 


ttt ttc eca ttc agt 


ctg 


atg 


twg 


374 


Val 


Leu 


Arg 


Gly Asn 
35 


Ala 


Phe 


Ser 


Phe Phe Pro Phe Ser 
40 


Leu 


Met 
45 


Xaa 




get 


atg 


ggt 


tgt cat 


aga 


tgg 


c 










396 


Ala 


Met 


Gly 


Cys His 
50 


Arg 


Trp 















<210> 142 
<211> 357 



PCT/IB99/00712 

87 
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<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 292. .357 



<221> sig_peptide 
<222> 292. .339 
<223> Von Heijne matrix 
score 8 

seq FLLGAIFIALSSS/RI 



<400> 142 

cgtgcctgcg caatgggtgt cgggtccgct ttttcccaat ccggacgtaa tcgtggtttt 60 
tgttctgcaa taggcggctt agagggaggg gctttttcgc ctatacctac tgtagcttct 120 
ccacgtatgg accctaaagg ctactgctgc tactacgggg ctagacagtt actgtctcag 180 
ctctaggatg tgcgttcttc cactagaagc tcttctgagg gaggtaatta aaaaacagtg 240 
gaatggaaaa acagtgctgt agtcatcctg taatatgctc cttgtcaaca a atg tat 297 

Met Tyr 
-15 

aca ttc ctg eta ggt gcc ata ttc att get tta age tea agt cgc ate 34 5 

Thr Phe Leu Leu Gly Ala lie Phe lie Ala Leu Ser Ser Ser Arg lie 

-10 -5 1 

tta eta gtg aag 357 
Leu Leu Val Lys 
5 



<210> 143 
<211> 159 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 26. .157 



<221> sig_peptide 

<222> 26. .151 

<223> Von Heijne matrix 

score 7.90000009536743 
seq LVCVCVCVCVCXC/XR 



<400> 143 

tgtgtgtgtg tgtgtctgcg tgtgt atg tgt ttg tgt ecc tgc tgg gat gtg 52 

Met Cys Leu Cys Pro Cys Trp Asp Val 
-40 -35 
ttt act gtg ttt gtg tgt gtc tct gtg tgt gtg tct gtg tct gtc cct 100 
Phe Thr Val Phe Val Cys Val Ser Val Cys Val Ser Val Ser Val Pro 

-30 -25 -20 

gtc ggg atg tat tta gtg tgt gtg tgt gtg tgt gtg tgt gtg tgt stc 14 8 

Val Gly Met Tyr Leu Val Cys Val Cys Val Cys Val Cys Val Cys Xaa 

-15 -10 -5 

tgc gyg cgt gg 159 
Cys Xaa Arg 
1 



<210> 144 
<211> 433 
<212> DNA 
<213> Homo sapiens 



<220> 
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<221> CDS 
<222> 282. .431 



<221> sig_j>eptide 

<222> 282. .383 

<223> Von Heijne matrix 

score 7.90000009536743 
seq LFSLLMLTQSPLA/GQ 

<221> misc_feature 

<222> 132,149 

<223> n=a, g, c or t 



<400> 144 

aaaataaggt atctggcaaa agaatatatg aaagagtatg aagaactctc cttgaaagct 60 
gtggccccca ttggccatgg ctgcagagcc gatgtcccgg ccaatccagg cgggatcccc 12 0 
ttgaagcmgg knsmwhbtcy kragscwknc cmabtctccg ggggcaastc ttttcccttc 180 
cctgtgaccc kcttcggaca gttgaccatc tcaacaccta gtggttaaaa agaagagcat 24 0 
ggacggcctg gggcctgcac tggctgtgct gggagtttgt c atg ttg ata get aag 296 

Met Leu lie Ala Lys 
-30 

cag gee cag ccc caa ggc etc act gee ate tgc tte cct etc aca ect 344 
Gin Ala Gin Pro Gin Gly Leu Thr Ala lie Cys Phe Pro Leu Thr Pro 

-25 -20 -15 

etc ttc tec etc etc atg etc act cag age ccc ctt gea ggt cag gaa 392 
Leu Phe Ser Leu Leu Met Leu Thr Gin Ser Pro Leu Ala Gly Gin Glu 

-10 -5 1 

gga aga gaa gga ggg aaa gaa egg tac ttg ttg gtg att ca 433 
Gly Arg Glu Gly Gly Lys Glu Arg Tyr Leu Leu Val lie 
5 10 15 



<210> 145 
<211> 200 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 15. .200 



<221> sig_peptide 

<222> 15. .92 

<223> Von Heijne matrix 

score 7.90000009536743 
seq RVCLLSLSLFLWA/NR 



<400> 145 

aatacgccag gaac atg eta 
Met Leu 

-25 

ttt egg gtt tgc ttg etc 
Phe Arg Val Cys Leu Leu 
-10 

tta gag gac agt cgc tec 
Leu Glu Asp Ser Arg Ser 
5 

ttg eeg ggc eac agg etc 
Leu Pro Gly His Arg Leu 

20 
ett ggg 
Leu Gly 
35 



agg ace tgg age tet eta 
Arg Thr Trp Ser Ser Leu 

-20 

tet etc tet etc ttt etc 
Ser Leu Ser Leu Phe Leu 
-5 

tgc caa cct aat ccc atg 
Cys Gin Pro Asn Pro Met 
10 

aaa gaa gca gtg tgg ctg 
Lys Glu Ala Val Trp Leu 
25 30 



ccc tgg acc cgt 50 
Pro Trp Thr Arg 

-15 

tgg get aat cgt 98 
Trp Ala Asn Arg 
1 

age ctg act acc 146 

Ser Leu Thr Thr 

15 

eea gea ece tea 194 
Pro Ala Pro Ser 

200 
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<210> 146 
<211> 297 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 80. .295 

<221> sig_peptide 

<222> 80. .166 

<223> Von Heijne matrix 

score 7.90000009536743 

seq LWXWLLPXQCSC/ER 

<400> 146 

aacaccccag cccaagttca tccccggtcc cttggcagca gtgcgcatcc acaaagccag 60 
cggcacaatt taattactg atg gcc cct ttc eta cga cag gtg gat rtg tgg 112 

Met Ala Pro Phe Leu Arg Gin Val Asp Xaa Trp 
-25 -20 



gga gca 


cag 


gee 


ggt ctg gtg gtb gsm tgg 


tta 


eta 


cea 


tgs 


caa 


tge 


160 


Gly Ala 


Gin 


Ala 


Gly Leu Val Val 


Xaa 


Trp 


Leu 


Leu 


Pro 


Xaa 


Gin 


Cys 








-15 




-10 










-5 








age tgt 


gaa 


cga 


tea gag eaa tat 


ctg 


age 


ace 


tgt 


etc 


cea 


cag 


cac 


208 


Ser Cys 


Glu 
1 


Arg 


Ser Glu Gin Tyr 
5 


Leu 


Ser 


Thr 


Cys 
10 


Leu 


Pro 


Gin 


His 




tea age 


ate 


aag 


cag teg tge ate 


aag 


cat 


cea 


gca 


ggc 


ccg 


ate 


ccc 


256 


Ser Ser 


lie 


Lys 


Gin Ser Cys lie 


Lys 


His 


Pro 


Ala 


Gly 


Pro 


He 


Pro 




15 






20 






25 










30 




gca ggc 


cac 


eta 


cag gga aag gcc 


aca 


get 


gcg 


ccc 


ctg 


gg 






297 


Ala Gly 


His 


Leu 


Gin Gly Lys Ala 


Thr 


Ala 


Ala 


Pro 


Leu 











35 40 



<210> 147 
<211> 300 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 80. .298 

<221> sig_peptide 

<222> 80. .136 

<223> Von Heijne matrix 

score 7.90000009536743 

seq WLFLVAILKGVRC/EV 

<400> 147 

agctctgaga gaggagccca gccctgggat cttcaggtgt tttcactcgg tgatcaggac 60 
tgeacagaga gaactcacc atg gag ttt ggg ctg aag tgg ett ttt ctt gtg 112 

Met Glu Phe Gly Leu Lys Trp Leu Phe Leu Val 
-15 -10 
gca att tta aaa ggt gtc egg tgt gaa gtg aag ctg gtg gag tct ggg 160 
Ala He Leu Lys Gly Val Arg Cys Glu Val Lys Leu Val Glu Ser Gly 

-5 15 
gga ggc ctg gtg cag ccg ggg ggg tec ctg aga etc tec tgt gta gga 208 
Gly Gly Leu Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys Val Gly 

10 15 20 

tct gga ttc gtc ttc gat aaa tat ggc ata agt tgg gtg cgc cag gca 256 
Ser Gly Phe Val Phe Asp Lys Tyr Gly He Ser Trp Val Arg Gin Ala 
25 30 35 40 



wo 99/53051 
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cca gga aag ggc eta cag tgg gtc gcg ggg ate ggt ggc ggg gg 300 
Pro Gly Lys Gly Leu Gin Trp Val Ala Gly lie Gly Gly Gly 
45 50 

<210> 148 
<211> 405 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 21. .404 

<221> sig_peptide 

<222> 21. .68 

<223> Von Heijne matrix 

score 7.90000009536743 

seq AMLVLWSPWSAA/RG 

<400> 148 

gcggtcttcc agcagggaaa atg gcg ctg gcc atg ctg gtc ttg gtg gtt teg 53 

Met Ala Leu Ala Met Leu Val Leu Val Val Ser 
-15 -10 



ccg 


tgg 


tet 


gcg 


gee 


egg 


gga gtg 


ett 


cga 


aac 


tac 


tgg 


gag 


cga 


ctg 


101 


Pro 


Trp 


Ser 


Ala 


Ala 


Arg 


Gly Val 


Leu 


Arg 


Asn 


Tyr 


Trp 


Glu 


Arg 


Leu 




-5 










1 






5 










10 






eta 


egg 


aag 


ett 


ccg 


cag 


age egg 


ccg 


ggc 


ttt 


cec 


agt 


cct 


ccg 


tgg 


149 


Leu Arg 


Lys 


Leu 


Pro 


Gin 


Ser Arg 


Pro 


Gly 


Phe 


Pro 


Ser 


Pro 


Pro 


Trp 










15 








20 










25 








gga 


eca 


gca 


tta 


gca 


gta 


cag ggc 


cca 


gcc 


atg 


ttt 


aca 


gag 


cca 


gca 


197 


Gly 


Pro 


Ala 


Leu 


Ala 


Val 


Gin Gly Pro 


Ala 


Met 


Phe 


Thr 


Glu 


Pro 


Ala 








30 








35 










40 










aat 


gat 


ace 


agt 


gga 


agt 


aaa gag 


aat 


tec 


age 


ett 


ttg 


gac 


agt 


ate 


245 


Asn Asp 


Thr 


Ser 


Gly 


Ser 


Lys Glu 


Asn 


Ser 


Ser 


Leu 


Leu 


Asp 


Ser 


He 






45 










SO 








55 












ttt 


tgg 


atg 


gca 


get 


cec 


aaa aat 


aga 


cge 


ace 


att 


gaa 


gtt 


aac 


egg 


293 


Phe 


Trp 


Met 


Ala 


Ala 


Pro 


Lys Asn Arg 


Arg 


Thr 


He 


Glu 


Val 


Asn 


Arg 




60 










65 








70 










75 




tgt 


agg 


aga 


aga 


aat 


ccg 


cag aag 


ett 


att 


aaa 


gtt 


aag 


aac 


aac 


ata 


341 


Cys Arg 


Arg 


Arg 


Asn 


Pro 


Gin Lys 


Leu 


lie 


Lys 


Val 


Lys 


Asn 


Asn 


He 












80 








85 










90 






gac 


gtt 


tgt 


cct 


gaa 


tgt 


ggt cac 


ctg 


aaa 


cag 


aaa 


srt 


gtc 


ett 


tgt 


389 


Asp 


Val 


Cys 


Pro 
95 


Glu 


Cys 


Gly His 


Leu 
100 


Lys 


Gin 


Lys 


Xaa 


Val 
105 


Leu 


Cys 




get 


act 


get 


atg 


aaa 


a 




















405 


Ala 


Thr 


Ala 
110 


Met 


Lys 

























<210> 149 
<211> 146 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 56. .145 

<221> sig_peptide 

<222> 56. .115 

<223> Von Heijne matrix 

score 7.80000019073486 
seq LLLFPLSLLFTLG/FL 
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<400> 149 

aaaccttctg actactaacc tgtagatccc tttagttcct tagcagtatt cacaa atg 58 

Met 
-20 

ttt ttc tac tea cac ttt tta ctt ctt ttt ccc etc teg tta ctt tte 106 
Phe Phe Tyr Ser His Phe Leu Leu Leu Phe Pro Leu Ser Leu Leu Phe 

-15 -10 -5 

aca ctt gga ttt ttg ttt gtc ttt ttt ttt ttt ttt ttt t 146 
Thr Leu Gly Phe Leu Phe Val Phe Phe Phe Phe Phe Phe 
15 10 



<210> 150 
<211> 408 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 105. .407 



<221> sig_j)eptide 

<222> 105. .242 

<223> Von Heijne matrix 

score 7.80000019073486 
seq LVLLGTRVPLSGG/GP 



<400> 150 

aaacagggcc attggcaaag ctggggtacc agtcacccag ccacgctcta gggtggtagc 60 
caagaagacg gaccccgagt gggaggcaga gagacaagag gtgg atg aag cag age 116 

Met Lys Gin Ser 
-45 

aag cgt gas atg gtg aag aga aga egg age ccc geg ctg gga gag gaa 164 
Lys Arg Xaa Met Val Lys Arg Arg Arg Ser Pro Ala Leu Gly Glu Glu 

-40 -35 -30 

cgc ttc agt ccg agt tec att ctg cac cca agg etc ccc ttg gtc etc 212 
Arg Phe Ser Pro Ser Ser lie Leu His Pro Arg Leu Pro Leu Val Leu 

-25 -20 -15 

ctg gga ace agg gtg eec ctt agt ggt ggt ggc cca gga gaa ccc gac 260 
Leu Gly Thr Arg Val Pro Leu Ser Gly Gly Gly Pro Gly Glu Pro Asp 
-10 -5 15 

caa ggc agg age gee ccc tec tgg aag age etc get tea acg cat mat 308 
Gin Gly Arg Ser Ala Pro Ser Trp Lys Ser Leu Ala Ser Thr His Xaa 

10 15 20 

cat tec egg ccg gca gea ggg gcg acg cca gca agg cet geg act cag 356 
His Ser Arg Pro Ala Ala Gly Ala Thr Pro Ala Arg Pro Ala Thr Gin 

25 30 35 

age cag ctt ggc ccg ttc gee ccg ccc ctt ccc ggt gtc cgc ccc gee 404 
Ser Gin Leu Gly Pro Phe Ala Pro Pro Leu Pro Gly Val Arg Pro Ala 

40 45 50 

cca t 408 
Pro 
55 



<210> 151 
<211> 166 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 69. .164 



<221> sig^peptide 
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<222> 69. .122 

<223> Von Heijne matrix 

score 7.80000019073486 

seq LCVLSLLVSFKSA/CL 



<400> 151 

cacattttct acttaaaagc aamgttacaa agcctgtgga attgctctga cttagaaaga 60 
acttgatc atg ctt ttg gag tct eta tgt gtt etc tct etg ttg gtt agt 110 
Met Leu Leu Glu Ser Leu Cys Val Leu Ser Leu Leu Val Ser 
-15 -10 -5 

ttt aaa tea gee tgc etc aca agg gag cct gea ttt gat tec caa gee 158 
Phe Lys Ser Ala Cys Leu Thr Arg Glu Pro Ala Phe Asp Ser Gin Ala 

15 10 
cgc cog gg 166 
Arg Pro 



<210> 152 
<211> 382 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 99. .380 



<221> sig__peptide 

<222> 99. .236 

<223> Von Heijne matrix 

score 7.80000019073486 
seq LLYLSFAALGWA/LR 



<400> 152 

ttttacacac acacatacat acacacacac agctaattga gttttaaagt aatattcttg 60 
ctaateeeta etgaattgta gcttggtgtt gtttctga atg gtt ttt gga tat tgg 116 

Met Val Phe Gly Tyr Trp 
-45 



aag 


cag 


ccg 


ctg 


att 


acc 


ctt 


gea 


aag 


aaa 


tct 


gta 


aaa 


tgt 


gea 


cgt 


164 


Lys 


Gin 


Pro 


Leu 


He 


Thr 


Leu 


Ala 


Lys 


Lys 


Ser 


Val 


Lys 


Cys 


Ala 


Arg 




-40 










-35 










-30 










-25 




gaa 


tgt 


ctg aga 


tgc 


tct 


etc 


agg 


eet 


eta 


gtc 


ctt 


ctg 


tat 


ctt 


tec 


212 


Glu 


Cys 


Leu 


Arg 


Cys 


Ser 


Leu 


Arg 


Pro 


Leu 


Val 


Leu 


Leu 


Tyr 


Leu 


Ser 












-20 










-15 










-10 






ttt 


gea 


gcc 


ctg 


ggt 


gta 


gta 


gea 


etc 


agg 


agt 


gtt 


gaa 


tea 


ecc 


ctg 


260 


Phe 


Ala 


Ala 


Leu 
-5 


Gly 


Val 


Val 


Ala 


Leu Arg 


Ser 


Val 


Glu 
5 


Ser 


Pro 


Leu 




gee 


gag 


ace 


cac 


tee 


tgc 


tgg 


etc 


age 


ctg 


ggc 


atg 


tgt 


gtg 


etc 


cag 


308 


Ala 


Glu 


Thr 


His 


Ser 


Cys 


Trp 


Leu 


Ser 


Leu 


Gly Met 


Cys 


Val 


Leu 


Gin 






10 










15 










20 












tgt 


gaa 


cag 


cag 


tgg 


gtt 


cca 


acc 


cca 


gtc 


tec 


ttt 


etc 


tgt 


ggc 


etc 


356 


Cys 


Glu 


Gin 


Gin 


Trp 


Val 


Pro 


Thr 


Pro 


Val 


Ser 


Phe 


Leu 


Cys 


Gly Leu 




25 










30 










35 










40 




tct 


ggc 


tec 


age 


acc 


ate 


ate 


gtt 


ag 
















382 


Ser Gly 


Ser 


Ser 


Thr 


He 


He 


Val 





















45 



<210> 153 
<211> 208 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 10. .207 
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<221> sig_peptide 

<222> 10. .81 

<223> Von Heijne matrix 

score 7.80000019073486 
seq CVYWCLVSCVLC/W 



<400> 153 

tgcaatgtc atg tgt gtt gtg tgc agt gtg cat ggt gtg tgt tgt gta tat 51 

Met Cys Val Val Cys Ser Val His Gly Val Cys Cys Val Tyr 
-20 -15 



gtg gtg 


tgc 


ctg 


gtg 


teg 


tgt 


gtt 


ttg 


tgt 


gtc 


gtg 


tgt 


cct gtg 


tgt 


99 


Val Val 


Cys 


Leu 


Val 


Ser 


Cys 


Val 


Leu 


Cys 


Val 


Val 


Cys 


Pro Val 


Cys 




-10 








-5 










1 






5 






tgg gtt 


atg 


tgt 


tgt 


gtg 


tgg 


tgc 


ate 


tgt 


gtg 


tgt 


gtg 


tgg tgt 


gtc 


147 


Trp Val 


Met 


Cys 
10 


Cys 


Val 


Trp 


Cys 


He 

15 


Cys 


Val 


Cys 


Val 


Trp Cys 
20 


Val 




tgt tgt 


atg 


tgt 


tgt 


gtg 


ttg 


tea 


tgt 


gtt 


gtg 


tea 


cat 


ggg ttg 


tgt 


195 


Cys Cys 


Met 


Cys 


Cys 


Val 


Leu 


Ser 


Cys 


Val 


Val 


Ser 


His 


Gly Leu Cys 






25 










30 










35 








ggt gtg 


tea 


tgg 


g 






















208 


Gly Val 


Ser 


Trp 



























40 



<210> 154 

<211> 251 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> 73. .249 



<221> sig_peptide 

<222> 73. .129 

<223> Von Heijne matrix 

score 7.80000019073486 
seq WVFLVAVLEWQC/EI 



<400> 154 

agagaggagc cteagcccta gactccaagg cctttccact tggtgateag cactgagcac 60 
agaggactca cc atg gaa ctg ggg ctg tec tgg gtc ttc ctt gtt get gtt 111 
Met Glu Leu Gly Leu Ser Trp Val Phe Leu Val Ala Val 
-15 -10 
tta gaa gtt gtc cag tgt gaa att caa ctg att gac gcc ggg gga ggc 159 
Leu Glu Val Val Gin Cys Glu He Gin Leu He Asp Ala Gly Gly Gly 

-5 15 10 

cac gtc cag gcg ggg ggg tea ctg aga etc tec tgt gtt gcc tct gac 2 07 

His Val Gin Ala Gly Gly Ser Leu Arg Leu Ser Cys Val Ala Ser Asp 

15 20 25 

ttc ctg ttt aga age tat tgg atg ace tgg gtc cgc cat ccg gg 251 
Phe Leu Phe Arg Ser Tyr Trp Met Thr Trp Val Arg His Pro 
30 35 40 



<210> 155 

<211> 147 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 24 . .146 
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<221> sig_j>eptide 

<222> 24. .140 

<223> Von Heijne matrix 

score 7.80000019073486 

seq ILLFLFLILFIWH/IR 

<400> 155 

ggattttgtc aaatgctttt tct atg teg att ttg ctg agg gtt tta ggc ata 53 

Met Ser lie Leu Leu Arg Val Leu Gly lie 
-35 -30 
aag gga tgc tgg att ttg tea aat cet ttt tct gca tgt att gag atg 101 
Lys Gly Cys Trp lie Leu Ser Asn Pro Phe Ser Ala Cys lie Glu Met 

-25 -20 -15 

ate ttg tta ttt ttg ttt tta att ctg ttt ata tgg cac att egg g 147 
lie Leu Leu Phe Leu Phe Leu lie Leu Phe lie Trp His lie Arg 
-10 -5 1 

<210> 156 
<211> 141 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 61. .141 

<221> sig_jpeptide 
<222> 61. .135 
<223> Von Heijne matrix 

score 7.69999980926514 

seq LVPILLLIGWIVG/CT 

<400> 156 

getggataae aaaagaaaga ggtaagegtg geetgaeeta 
atg get gaa aaa gcg ggg tct aca ttt tea eae 
Met Ala Glu Lys Ala Gly Ser Thr Phe Ser His 
-25 -20 -15 

ctt etc ctg att ggc tgg att gtg gge tgc ace 
Leu Leu Leu lie Gly Trp lie Val Gly Cys Thr 
-5 1 

<210> 157 
<211> 115 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 12. .113 

<22l> sig_peptide 
<222> 12. .68 
<223> Von Heijne matrix 

score 7.69999980926514 

seq RLYLWMCLAAALA/SF 

<400> 157 

ctcaagaagc c atg gcg gaa tec agg ggc cgt ctg tac ctt tgg atg tgc 
Met Ala Glu Ser Arg Gly Arg Leu Tyr Leu Trp Met Cys 
-15 -10 
ttg get get gcg ctg gca tct ttc ctg atg gga ttt atg gtg ggc tgg 
Leu Ala Ala Ala Leu Ala Ser Phe Leu Met Gly Phe Met Val Gly Trp 
-5 15 10 



gccacceacc aaeaggaata 60 
ctt ctg gtt ect att 108 
Leu Leu Val Pro lie 
-10 

141 



50 
98 
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ttt att aag cct ctg gg 
Phe lie Lys Pro Leu 
15 



115 



<210> 158 
<211> 175 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 54.. 173 



<221> sig^peptide 

<222> 54. .131 

<223> Von Heijne matrix 

score 7.69999980926514 
seq FLLLLYFFXIAVT/HP 



<400> 158 

caattcaaca tgagatttag tggtgacaaa tatccaaact ctatcaacct eta atg 56 

Met 

ctg acc tea ctg cct ttc etc ctg ccc acc ate age ttt etc etc etc 104 
Leu Thr Ser Leu Pro Phe Leu Leu Pro Thr lie Ser Phe Leu Leu Leu 
-25 -20 -15 -10 

ttg tat ttt ttt cma att get gtc acc cat ccg tea gtt etc ate aae 152 
Leu Tyr Phe Phe Xaa He Ala Val Thr His Pro Ser Val Leu He Asn 

-5 15 
ttc tet ttc tec ttc ccc aga te 175 
Phe Ser Phe Ser Phe Pro Arg 
10 



<210> 159 
<211> 230 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 121. .228 

<221> sig_peptide 

<222> 121. .180 

<223> Von Heijne matrix 

score 7.59999990463257 
seq LLFFTCGLPALHG/DS 

<221> misc_feature 
<222> 18 

<223> n=a, g, c or t 



<400> 159 

aggaggggcc gteagggngg gataeagcct ggaaggtgeg tgtggggctg ggtcteggag 60 

tgggagaegt ggagtgcagg taatgeatgt ceatggtaca caaatteaca aggtttgtaa 120 

atg aga aaa gac gtg agg ttc ctt ttg ttc ttt acc tgt ggc etc cct 168 

Met Arg Lys Asp Val Arg Phe Leu Leu Phe Phe Thr Cys Gly Leu Pro 

-20 -15 -10 -5 

gee eta cac ggg gac tet agg gtg gaa tgt age aaa gee cat cca cca 216 

Ala Leu His Gly Asp Ser Arg Val Glu Cys Ser Lys Ala His Pro Pro 

15 10 
gee atg tac tac ee 230 
Ala Met Tyr Tyr 
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15 

<210> 160 
<211> 346 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 202 . .345 

<221> sig_peptide 

<222> 202 . .282 

<223> Von Heijne matrix 

score 7.59999990463257 

seq WTLLSISLSVFWS/EP 

<400> 160 

ttcttctaca tacagctacc caactagccc acaccattta ttgaatacag agtagtcttt 60 
tccctgttgg ttatttttct taactttgtt aaagatcaga tatctgtagg tgtgcagctt 120 
tatttctggg ttttctgttc cgttccattg gtctatgtgt ctgtttttgt accagtacca 180 
tgctgttctg gcaccagtac c atg eta ttt tgg tta cca tct cca tct gag 231 

Met Leu Phe Trp Leu Pro Ser Pro Ser Glu 
-25 -20 
acc act tea gcc tgg act tta ttg tec ata tea eta tea gta ttt tgg 279 
Thr Thr Ser Ala Trp Thr Leu Leu Ser lie Ser Leu Ser Val Phe Trp 

-15 -10 -5 

tea gag cca ttc aat aag tct eta gga agt tec aaa eta cca tgt cat 327 
Ser Glu Pro Phe Asn Lys Ser Leu Gly Ser Ser Lys Leu Pro Cys His 

1 5 10 15 

ttt ttt tct ata aaa egg g 346 
Phe Phe Ser lie Lys Arg 
20 

<210> 161 
<211> 388 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 194. .388 

<221> sig_peptide 

<222> 194 . .334 

<223> Von Heijne matrix 

score 7.59999990463257 

seq LXLGEGLTFLCLC/QV 

<221> misc_feature 
<222> 352 

<223> n=a, g, c or t 



<400> 161 

agtgagagct tagtcttggt actatttgtt tttgtttctt actgtttgtc tgtttatggt 60 
tggttgcaag aaaattgtgt tgtaaattat eccttgettt ctetattagt taatagcett 120 
ccccttctgt agtaaagtaa msagsctttt kcctgttcaa atattttagg cttgtttttt 180 
gttttgattg tac atg cct gtg tgt ttt tat tec tta att tgt ttc ttt 229 
Met Pro Val Cys Phe Tyr Ser Leu lie Cys Phe Phe 
-45 -40 
att tat ttc tgt ttg tta tct cca aga gaa aca ata gaa gag gtg gcc 277 
lie Tyr Phe Cys Leu Leu Ser Pro Arg Glu Thr lie Glu Glu Val Ala 
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-35 -30 -25 -20 

etc ttc cag ttt tct ctg cth mtc ttg gga gag ggt etc acc ttt ctt 325 

Leu Phe Gin Phe Ser Leu Leu Xaa Leu Gly Glu Gly Leu Thr Phe Leu 

-15 -10 -5 

tgc etc tgc cag gta atg acg aat aan atg caa ctg ctg ttc ttg agt 373 
Cys Leu Cys Gin Val Met Thr Asn Xaa Met Gin Leu Leu Phe Leu Ser 

15 10 
ggg gta gtc tgt ggg 388 
Gly Val Val Cys Gly 
15 



<210> 162 
<211> 235 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 172. ,234 



<22l> sig_peptide 
<222> 172 . .210 
<223> Von Heijne matrix 
score 7 . 5 

seq MAPLLLSLSCSFS/CH 



<400> 162 

ccectccaaa tctcatgttg agatttgatc cctaatgttg gagatggggc ctggtgggag 60 
atatteggat catgagggca gatecctcac taatggcctg gtgccctccc tgtggaaatg 120 
agtaagttct cactcttttg gttcacctga gagctgtttg tttaaaagag c atg gca 177 

Met Ala 

ccc etc ctt etc tct ctg tct tgc tec ttt tct tgc cat gtg aca etc 225 
Pro Leu Leu Leu Ser Leu Ser Cys Ser Phe Ser Cys His Val Thr Leu 
-10 -5 15 

ctg ccc egg g 235 
Leu Pro Arg 



<210> 163 
<211> 240 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 99. .239 



<221> sig_jpeptide 
<222> 99. .158 
<223> Von Heijne matrix 
score 7 . 5 

seq LLWVLLLNLGPRA/AG 



<400> 163 

aaaacgaccc ggtgggtcta cageggaagg gagggagcga aggtaggagg cagggcttgc 60 
ctcaetggcc accctcccaa ccecaagagc ecagcccc atg gtc ccc gee gcc ggs 116 

Met Val Pro Ala Ala Gly 
-20 -15 
gcg ctg ctg tgg gtc ctg ctg ctg aat ctg ggt ccc egg geg gcg ggg 164 
Ala Leu Leu Trp Val Leu Leu Leu Asn Leu Gly Pro Arg Ala Ala Gly 

-10 -5 1 

gcc caa ggc ctg acc cag act ccg acc gaa atg cag egg gtc agt tta 212 
Ala Gin Gly Leu Thr Gin Thr Pro Thr Glu Met Gin Arg Val Ser Leu 
5 10 15 
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cgc ttt ggg ggc ccc atg acc cgc agg g 24 0 

Arg Phe Gly Gly Pro Met Thr Arg Arg 
20 25 

<210> 164 
<211> 195 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 20. .193 

<221> sig_peptide 
<222> 20. .91 
<223> Von Heijne matrix 
score 7.5 

seq LISELLLLRSVTS/HN 
<400> 164 

ttctgattat gatggtaat atg gta ttc tgg gaa ata tct gtc caa att ate 52 

Met Val Phe Trp Glu He Ser Val Gin He He 
-20 -15 
ctg ate tct gaa etc ctg ctg ttg agg tea gtc act tea cac aat acc 100 
Leu He Ser Glu Leu Leu Leu Leu Arg Ser Val Thr Ser His Asn Thr 

-10 -5 1 

atg atg aga get tta tea age eag atg ett agt eag age ttt eca aga 14 8 

Met Met Arg Ala Leu Ser Ser Gin Met Leu Ser Gin Ser Phe Pro Arg 

5 10 15 

ccc age ttt ggt ttt ate age aaa ate cat cct tec cac ccc ccc aa 195 
Pro Ser Phe Gly Phe He Ser Lys He His Pro Ser His Pro Pro 
20 25 30 

<210> 165 
<211> 256 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> CDS 
<222> 34. .255 

<221> sig_peptide 
<222> 34. .186 
<223> Von Heijne matrix 
score 7.5 

seq WSLTFLLGMTWG/FA 

<221> misc_feature 
<222> 18 

<223> n=a, g, c or t 



<400> 165 

tatttatgtg acetgtgngg gtattttgga gtc atg ttt ttt ctg aac att gee 54 

Met Phe Phe Leu Asn He Ala 
-50 -45 
atg ttc att gtg gta atg gtg eag ate tgt ggg agg aat gge aag aga 102 
Met Phe He Val Val Met Val Gin He Cys Gly Arg Asn Gly Lys Arg 

-40 -35 -30 

age aac egg acc ctg aga gaa gaa gtg tta agg aac ctg cgc agt gtg 150 
Ser Asn Arg Thr Leu Arg Glu Glu Val Leu Arg Asn Leu Arg Ser Val 
-25 -20 -15 



wo 99/53051 

gtt age ttg acc ttt ctg 
Val Ser Leu Thr Phe Leu 
-10 

gcc tgg gga ccc tta aat 
Ala Trp Gly Pro Leu Asn 
5 10 
aat tea tta c 
Asn Ser Leu 

<210> 166 
<211> 209 
<212> DNA 
<213> Homo sapiens 



100 

ttg ggc atg aca tgg ggt 
Leu Gly Met Thr Trp Gly 

-5 

ate cec ttc atg tac etc 
He Pro Phe Met Tyr Leu 
15 



PCT/IB99/00712 

ttt gea ttc ttt 198 

Phe Ala Phe Phe 

1 

ttc tee ate ttc 246 
Phe Ser He Phe 
20 

256 



<220> 

<221> CDS 
<222> 36. .209 



<221> sig_peptide 
<222> 36. .86 
<223> Von Heijne matrix 
score 7 . 5 

seq FLFLFLLXXLIVA/VT 



<400> 166 

cttttttdtc ckgcacaagg gatttccggg tcagg atg aac aaa cac ttc ttg 53 

Met Asn Lys His Phe Leu 

-15 

ctt dac kgc etc att gtg gea gtg aca tea ctt cag 101 
Leu Xaa Xaa Leu He Val Ala Val Thr Ser Leu Gin 

-S 15 
cac ctt cgc aca egg aca gac ege tgt aga aga ggc 149 
His Leu Arg Thr Arg Thr Asp Arg Cys Arg Arg Gly 
10 15 20 

act get cag aag ggc gag gea tgc atg etc tta agg 197 
Thr Ala Gin Lys Gly Glu Ala Cys Met Leu Leu Arg 
30 35 

209 



ttc 


etc 


ttc 


etc 


Phe 


Leu 


Phe 


Leu 




-10 






tgc 


ata 


aca 


tgc 


Cys 


He 


Thr 


Cys 


ttt 


ggt 


gde 


tgt 


Phe Gly 


Xaa 


Cys 








25 


att 


cac 


cag 


cgc 


He 


His 


Gin 


Arg 






40 





<210> 167 
<211> 184 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 44. .184 



<221> sig_peptide 
<222> 44 . . 148 
<223> Von Heijne matrix 
score 7.5 

seq LLLTSHFLGESLG/GG 



<400> 167 

taaagtcctg tgtatgaeat gacatagtat ttgcgtaatt taa atg tac ata aag 55 

Met Tyr He Lys 
-35 

atg gag tet gte acc ttg tea cea gcc cca gtc ttc ccc gtc cct gea 103 
Met Glu Ser Val Thr Leu Ser Pro Ala Pro Val Phe Pro Val Pro Ala 
-30 -25 -20 



wo 99/53051 PCT/IB99/00712 

101 

car etc ctt tta ctg aca tec cat ttt eta ggc gag tec ett ggt gga 151 
Gin Leu Leu Leu Leu Thr Ser His Phe Leu Gly Glu Ser Leu Gly Gly 
-15 -10 -5 1 

ggc aca ctg ctt gtc cca etc etc ccc cea ggg 184 
Gly Thr Leu Leu Val Pro Leu Leu Pro Pro Gly 
5 10 

<210> 168 
<211> 218 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 97. .216 

<221> sig_j)eptide 

<222> 97.. 177 

<223> Von Heijne matrix 

score 7.40000009536743 

seq ILLLTICAAGIXG/TR 

<400> 168 

ccttccctcc gcgcacaggc tgccggctca ccgcttgcta atggcagccg gggtctcect 60 
gggacagcaa gacctccgct caggcccctc tttcga atg ckc cam gem etc ctg 114 

Met Xaa Xaa Ala Leu Leu 
-25 

cga tct aga atg att eag ggc agg ate ctg etc ctg ace ate tgc get 162 
Arg Ser Arg Met lie Gin Gly Arg lie Leu Leu Leu Thr lie Cys Ala 

-20 -15 -10 

gee ggc att rgt ggg act cgt cag ttt ggc tat aac etc tct ate ate 210 
Ala Gly lie Xaa Gly Thr Arg Gin Phe Gly Tyr Asn Leu Ser He He 
-5 15 10 

aat gac cc 218 
Asn Asp 

<210> 169 
<211> 480 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 317. .478 

<221> sigjpeptide 

<222> 317. .457 

<223> Von Heijne matrix 

score 7.40000009536743 

seq SCLFSXAWLXCXC/HG 

<400> 169 

gtctcgtggg ctggtcccca gcggctccet ccccgaacag ctgctgctcc agggaggaag 60 
cggcgyrrgt gmtgtccagc ttcccggtgc tgaaaacegg agggctcgtc atecaccact 120 
accatgtaag ggceatgaga agggctcatc ctggcgcasg eggacatgga ggaggactta 180 
ttccagctaa ggcagetgcc ggttgtgaaa ttccgtcgca caggegagag tgcaaggtca 24 0 
gaggacgaca eggcttcagg agagcatgaa gtccagattg aaggggtcca cgtgggccta 300 
gaggctgtgg agctgg atg atg ggg cak ctg tgc cca agg agt ttg cea ate 352 
Met Met Gly Xaa Leu Cys Pro Arg Ser Leu Pro He 
-45 -40 
cca ccg atg ata ctt tea tgg tgg aag atg eag tgg aag cea ttg get 4 00 

Pro Pro Met He Leu Ser Trp Trp Lys Met Gin Trp Lys Pro Leu Ala 
-35 -30 -25 -20 
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ttg gaa aat ttc agt gga age tgt ctg ttc tea mtg get tgg ctt kga 448 
Leu Glu Asn Phe Ser Gly Ser Cys Leu Phe Ser Xaa Ala Trp Leu Xaa 

-15 -10 -5 

tgc tsa tgc cat gga gat gat gat etc age at 480 
Cys Xaa Cys His Gly Asp Asp Asp Leu Ser 
1 5 

<210> 170 
<211> 280 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 135. .278 

<221> sig_peptide 

<222> 135 . . 179 

<223> Von Heijne matrix 

score 7.40000009536743 

seq LLQLLAFSFLGNS/VE 

<221> misc_feature 
<222> 104 

<223> ns=a, g, e or t 



<400> 170 

ttetttggge tegggggctc ccggagcagg gegagagete gegtcgcegg aaaggaagac 60 
gggaagaaag ggeaggeggc tcggcgggcg tcttctccac tccntgccge gccccgtggc 120 
tgeagggagc cggc atg ggg ctt etc cag ttg eta get ttc agt ttc tta 170 
Met Gly Leu Leu Gin Leu Leu Ala Phe Ser Phe Leu 
-15 -10 -5 

ggt aat tee gtg gaa acg gtg egg gga ggc gga egg act tgg gca tgg 218 
Gly Asn Ser Val Glu Thr Val Arg Gly Gly Gly Arg Thr Trp Ala Trp 

15 10 
gga agg aaa ace caa aag ctg ctt get eac ctt cgt ggg ate ctg ggg 266 
Gly Arg Lys Thr Gin Lys Leu Leu Ala His Leu Arg Gly lie Leu Gly 

15 20 25 

get tgg gas agg ga 280 
Ala Trp Xaa Arg 
30 

<210> 171 
<211> 103 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 28. .102 

<221> sig_peptide 

<222> 28. .69 

<223> Von Heijne matrix 

score 7.40000009536743 

seq LVLVHSSLSKTLS/QK 

<400> 171 

aetgggatge agaggetgca gtgagec atg ttg gtg ctg gtg eac tee age ctg 54 

Met Leu Val Leu Val His Ser Ser Leu 
-10 

age aag ace ttg tet cag aaa aaa aaa aag ttc aca aas ccc ace agg g 103 
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Ser Lys Thr Leu Ser Gin Lys Lys Lys Lys Phe Thr Xaa Pro Thr Arg 
-5 1 5 .10 

<210> 172 
<211> 218 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 73. .216 

<221> sig_peptide 

<222> 73. .129 

<223> Von Heijne matrix 

score 7.40000009536743 

seq LILVISCLLLAFE/CV 

<400> 172 

caattttgtt gatcttttca aaaaaccagc tcctggattc attaattttt tgaagggttt 60 
tttgatgtct ct atg tec ttc agt tct get ctg att tta gtt att tct tgc 111 
Met Ser Phe Ser Ser Ala Leu lie Leu Val He Ser Cys 
-15 -10 
ctt ctg eta get ttt gaa tgt gtt tgc tct tgc ttt tct ggt tct ttt 159 
Leu Leu Leu Ala Phe Glu Cys Val Cys Ser Cys Phe Ser Gly Ser Phe 

-5 15 10 

aat tgt gat gtt agg gtg tea att teg gat ctt tec tgc ttt etc ttg 207 
Asn Cys Asp Val Arg Val Ser lie Ser Asp Leu Ser Cys Phe Leu Leu 

15 20 25 

tgg gge aag gg 218 
Trp Gly Lys 

<210> 173 
<211> 380 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 295. .378 

<221> sig_peptide 

<222> 295 . - 360 

<223> Von Heijne matrix 

score 7.40000009536743 

seq CLXVFLLTDRTLS/CR 

<400> 173 

tattggttat tetagttata cattagtcta aatttttttc aaagttttca aettetttgc 60 

ctttggtttg aatttcetcc tgtagcttgg agtagtttga tcatetgaag etttcttetc 120 

tcaaetcate aaagtcattc tccatccage tttgttccat tgctggtgag gaactgtgtt 180 

ccttcggagg aggagaggtg ctctgctttt ttgagtttcc agtttttctg ctctgttttt 24 0 

tccccatctt tgtggtttta tctacttttg gtctttgatg ctggtgatgt acag atg 297 

Met 

ggt ttt tgg tgt gga tgt cct ttc tgt ttg twa gtt ttc ctt eta aca 34 5 
Gly Phe Trp Cys Gly Cys Pro Phe Cys Leu Xaa Val Phe Leu Leu Thr 

-20 -15 -10 

gac agg ace etc age tgc agg tct gtt gga gtt gc 380 
Asp Arg Thr Leu Ser Cys Arg Ser Val Gly Val 
-5 15 



<210> 174 
<211> 139 
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<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 59. .139 

<221> sig_peptide 
<222> 59. .103 
<223> Von Heijne matrix 

score 7.30000019073486 

seq LLSLSLWGISTLS/ST 

<400> 174 

ataacagaat gatttacatt cctttgggta tatacccagt gatgggatat atgtgtca 
atg gta tta ctg tct tta agt ctt tgg ggc ate tec aca ctg tct tec 
Met Val Leu Leu Ser Leu Ser Leu Trp Gly He Ser Thr Leu Ser Ser 
-15 -10 -5 1 

aca aca att gaa eta att tac acc ccc ate ggg 
Thr Thr He Glu Leu He Tyr Thr Pro He Gly 
5 10 

<210> 175 
<211> 122 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 38. .121 

<221> sig_peptide 
<222> 38. . 112 
<223> Von Heijne matrix 

score 7.30000019073486 

seq LLHVHSFLPPVFS/TQ 

<400> 175 

ctacctgtcc ttgcgcacca cccttgtctg ggcctte atg gee tct etc ctg agt 55 

Met Ala Ser Leu Leu Ser 
-25 -20 
ggc ttt act age ttc tgt ctt ttg cac gtt cac tct ttc etc cct oca 103 
Gly Phe Thr Ser Phe Cys Leu Leu His Val His Ser Phe Leu Pro Pro 

-15 -10 -5 

gtg ttt tee ace eag aat g 122 
Val Phe Ser Thr Gin Asn 
1 

<210> 176 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 175. .300 

<221> sig_peptide 

<222> 175. .264 

<223> Von Heijne matrix 

score 7.30000019073486 

seq AILLXXWEAGSEA/VR 



58 
106 

139 
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<221> misc_feature 
<222> 51. .52,63,239 
<223> n=a, g, c or t 



<400> 176 

aaaaactcta aaagaaggac gcattttagg taagatctag tggctagatc nncagggtgg 60 
gcnkcgttct tgtggaaatc agtcaagaaa gatcggattc gcggttattt atgcaaatca 120 
tctgggtgga ttgtgtacgg agttaaactg cgccttctgg accgggtctg aaca atg 177 

Met . 

-30 

gag act gcg eta saa tka acg cca cag aaa agg caa gtt atg ttt ctt 225 
Glu Thr Ala Leu Xaa Xaa Thr Pro Gin Lys Arg Gin Val Met Phe Leu 

-25 -20 -15 

get ata ttg ttg cnt twg tgg gag get gge tct gag gca gth agg tat 273 
Ala lie Leu Leu Xaa Xaa Trp Glu Ala Gly Ser Glu Ala Val Arg Tyr 

-10 -5 1 

tec ata cca gaa gaa aca gaa agt ggc 300 
Ser lie Pro Glu Glu Thr Glu Ser Gly 
5 10 



<210> 177 
<211> 466 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 268. ,465 



<221> sig_j>eptide 

<222> 268. .372 

<223> Von Heijne matrix 

score 7.30000019073486 
seq LDLLGSSSPPTSA/SQ 



<400> 177 

cttaaacttt attatgttgk kttcacaaag agcagccttt gttgaetttg aaatcattgc 60 
ttcagtattc tagaaaatct tgtttttgtt aaacatgggc agtaacttac tatttttgta 120 
tagttgttgt wcatcktacc cccaccctgt tttaaaaata aaaagtagtt gtcagattac 180 
tttggettta gaagtacctt ttcacttgcc ttagaatctt cattactttg agcetacact 240 
ccacctctta ttggaacttc atgaaga atg atg ttg gat ttc get ctg teg ecc 294 

Met Met Leu Asp Phe Ala Leu Ser Pro 
-35 -30 
agg eta gag egc agt ggt ctg ate atg get tgc tgt ace ctt gae etc 342 
Arg Leu Glu Arg Ser Gly Leu lie Met Ala Cys Cys Thr Leu Asp Leu 

-25 -20 -15 

ctg ggt tea age agt ect ecc ace tea gcc tec cag gtg get ggg act 390 
Leu Gly Ser Ser Ser Pro Pro Thr Ser Ala Ser Gin Val Ala Gly Thr 
-10 -5 15 

ggg cat gtg cca cca eac cca get agt ttt ttt tae ttt ktt gta wga 438 
Gly His Val Pro Pro His Pro Ala Ser Phe Phe Tyr Phe Xaa Val Xaa 

10 15 20 

cag gtc tac tat gtt teg cag ctg ate t 466 
Gin Val Tyr Tyr Val Ser Gin Leu He 
25 30 



<210> 178 
<211> 222 
<212> DNA 

<213> Homo sapiens 



<220> 
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<221> CDS 
<222> 30. .221 

<221> sig_peptide 

<222> 30. .95 

<223> Von Heijne matrix 

score 7.19999980926514 
seq. QVFFLVFPDGVRP/QP 

<400> 178 



acgtcggacc cggaggccct gaatgcccc 


atg 


cgc 


acc 


cca 


cag 


etc 


gcg 


etc 


S3 














Met 


Arg 


Thr 
-20 


Pro 


Gin 


Leu 


Ala 


Leu 
-15 




ctg caa 


gtg 


ttc 


ttt ctg 


gtg 


ttc 


ccc 


gat 


ggc 


gtc 


egg 


cct 


cag 


ccc 


101 


Leu Gin 


Val 


Phe 


Phe Leu 


Val 


Phe 


Pro Asp Gly Val 


Arg 


Pro 


Gin 


Pro 










-10 








-5 










1 






tct tec 


tec 


cca 


tea ggg 


gca 


gtg 


ccc 


acg 


tct 


ttg 


gag 


ctg 


cag 


cga 


149 


Ser Ser 


Ser 
5 


Pro 


Ser Gly 


Ala 


Val 
10 


Pro 


Thr 


Ser 


Leu 


Glu 

15 


Leu 


Gin 


Arg 




ggg acg 


gat 


ggc 


gga acc 


etc 


cag 


tec 


cct 


tea 


gag 


gcg 


act 


gca 


act 


197 


Gly Thr 


Asp Gly 


Gly Thr 


Leu 


Gin 


Ser 


Pro 


Ser 


Glu 


Ala 


Thr 


Ala 


Thr 




20 








25 










30 












cgc ccg 


gcc 


gtg 


ccc gga 


etc 


egg 


g 
















222 


Arg Pro 


Ala 


Val 


Pro Gly 


Leu 


Arg 





















35 40 

<210> 179 
<211> 171 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 33 . .170 

<221> sig_peptide 

<222> 33 . .95 

<223> Von Heijne matrix 

score 7.19999980926514 

seq SWPLLAAVSGLRG/LE 

<400> 179 

cettttgect teaaccttcg agccgccacg ta atg cca cgt ccc cgc gca tgc 53 

Met Pro Arg Pro Arg Ala Cys 
-20 -15 
gca tct tgg ccg ctg ctg gcg get gtt tec ggg ctt aga ggg ctg gag 101 
Ala Ser Trp Pro Leu Leu Ala Ala Val Ser Gly Leu Arg Gly Leu Glu 

-10 -5 1 

tgg ccg ccg agt tgg agg egg gtg gtg gca gca gta gga gtg tgt aga 14 9 

Trp Pro Pro Ser Trp Arg Arg Val Val Ala Ala Val Gly Val Cys Arg 

5 10 15 

gtg egg gat tgg ggg ccc egg g 171 
Val Arg Asp Trp Gly Pro Arg 
20 25 

<210> 180 
<211> 245 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 177. .245 
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<221> sig_peptide 

<222> 177. .227 

<223> Von Heijne matrix 

score 7.19999980926514 

seq FLLIiISVLTVIWF/WK 

<400> 180 

tgtaattttc cttgccaaaa agcttagttt catcttttat aaatatccta taatgccaag 60 
ttgattgcat ggtcagagtg aatctgtgct gtacccawat tcagtagcct tctcctatcc 120 
aacaaagbgt tttgtaaata ggaggtaaat gaatgagtgg atggatggag ggatga atg 179 

Met 

aat gga att ttc ttg etc ttg ate tct gtc tta aca gtg att tgg ttt 227 
Asn Gly lie Phe Leu Leu Leu lie Ser Val Leu Thr Val lie Trp Phe 

-15 -10 -5 

tgg aag aca cac ccg ggg 245 
Trp Lys Thr His Pro Gly 
1 5 

<210> 181 
<211> 241 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 160. .240 

<221> sig_peptide 

<222> 160. .213 

<223> Von Heijne matrix 

score 7.19999980926514 

seq XLLCIIXLYLIRG/SE 

<400> 181 

gttgactttt ctetctgetg aggcagaaaa atgcttccat agtccatgca gcaatgttta 60 
aaaeaaggga tttegttccc ccctcvcctt ttgtgtaggc tggttaataa actctgtgtt 120 
tywtagcatt gtcgtgaawa ttcagagtgc tccctgcga atg gtt ttc eta gta 174 

Met Val Phe Leu Val 

-15 

kct ctg ttg tgt ate att kct ctt tat ttg att cgt ggt tct gag tgg 222 
Xaa Leu Leu Cys lie lie Xaa Leu Tyr Leu lie Arg Gly Ser Glu Trp 

-10 -5 1 

amc eta cea ccg aac tgg g 241 
Xaa Leu Pro Pro Asn Trp 
5 

<210> 182 
<211> 263 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 
<222> 103 . .261 

<221> sigjpeptide 

<222> 103. .156 

<223> Von Heijne matrix 

score 7.19999980926514 

seq LFFLLRIALASWA/LF 



<400> 182 
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gggttatcta acctgttcca ttgttccgtg tatcagtttc tgtaccgata ccatgctgtt 
ttggttactg tagtcttgta gtatagttta aagtcagata gc atg atg act eta 

Met Met Thr Leu 
-15 

get ttg ttc ttt ttg ctt agg att get ttg get agt tgg get ete ttt 
Ala Leu Phe Phe Leu Leu Arg He Ala Leu Ala Ser Trp Ala Leu Phe 

-10 -5 1 

tgg ate cat atg aat ttt aga aga get ttt ttc cac tta egg tgg ttt 
Trp He His Met Asn Phe Arg Arg Ala Phe Phe His Leu Arg Trp Phe 

5 10 15 

gat ate aat age act gaa tct gta aat tgc ttt ggg cag tat ggc eta 
Asp He Asn Ser Thr Glu Ser Val Asn Cys Phe Gly Gin Tyr Gly Leu 

20 25 30 

gcg gg 
Ala 
35 

<210> 183 
<211> 170 
<212> DNA 

c213> Homo sapiens 

<220> 
<221> CDS 
<222> 60. .170 

<221> sigjeptide 
<222> 60, .146 
<223> Von Heijne matrix 

score 7.09999990463257 

seq SLLVFCLNDLSNA/VX 

<400> 183 

ttccatgtgg agatgrraag aatatatatt ctgtggttat tgggtagagt gttctatag 59 
atg tct att agg tct aat tgg tct agt gtc gaa tct aag tct aga att 107 
Met Ser He Arg Ser Asn Trp Ser Ser Val Glu Ser Lys Ser Arg He 

-25 -20 -15 

tct tta tta gtt ttc tgc etc aat gat ctw tck aat gcw gtc arg wgg 155 
Ser Leu Leu Val Phe Cys Leu Asn Asp Leu Ser Asn Ala Val Xaa Xaa 

-10 -5 1 

ggm att gaa rtc cec 17 0 

Gly He Glu Xaa Pro 
5 

<210> 184 
<211> 443 

<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 83. .442 

<221> sig_peptide 
<222> 83. .130 
<223> Von Heijne matrix 

score 7.09999990463257 

seq IPLFLGVLAYCTG/SV 

<400> 184 

etttccagca aggggataag agaggcctgg aagaacctgc ccagcctggg cetcaggaag 
cagcatcgga ggtgcctcag cc atg gca tgg ate cct etc ttc etc ggc gtc 

Met Ala Trp He Pro Leu Phe Leu Gly Val 



60 
114 

162 

210 

258 

263 



60 
112 
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ctt get tac tgc aca gga 
Leu Ala Tyr Cys Thr Gly 
-5 

ccc tea gtg tec gtg tec 
Pro Ser Val Ser Val Ser 
15 

gga gat aaa ttg ggg gat 
Gly Asp Lys Leu Gly Asp 
30 

ggc cag tec cct gtg ctg 
Gly Gin Ser Pro Val Leu 

45 

ggg ate cct gag cga ttc 
Gly lie Pro Glu Arg Phe 
60 

ctg acc ate age ggg ace 
Leu Thr He Ser Gly Thr 
75 80 
cag gcg tgg gac age age 
Gin Ala Trp Asp Ser Ser 
95 



-15 



tec 


gtg 


gcc 


tec 


tat 


gag 


Ser 


Val 


Ala 


Ser 


Tyr 


Glu 


1 








5 




eca 


gga 


cag 


aca 


gee 


age 


Pro Gly Gin 


Thr 


Ala 


Ser 








20 






aaa 


tat 


get 


tgc 


tgg 


tat 


Lys 


Tyr 


Ala 


Cys 


Trp 


Tyr 






35 








gtc 


ate 


tat 


caa 


gat 


age 


Val 


He 


Tyr 


Gin 


Asp 


Ser 




50 










tct 


ggc 


tec 


aac 


tct 


ggg 


Ser Gly 


Ser 


Asn 


Ser 


Gly 


65 










70 


cag 


get 


atg 


gat 


gag 


get 


Gin 


Ala 


Met 


Asp 


Glu 


Ala 










85 




act 


gtg 


gta 


ttc 


ggc 


gga 


Thr 


Val 


Val 


Phe 


Gly 


Gly 



100 



-10 

ctg act cac eca 160 
Leu Thr His Pro 
10 

ate ace tgc tct 208 
He Thr Cys Ser 
25 

cag cag aag eca 256 
Gin Gin Lys Pro 
40 

aag egg ccc tea 304 
Lys Arg Pro Ser 

55 

aac aca gcc act 352 
Asn Thr Ala Thr 

gac tat tac tgt 4 00 

Asp Tyr Tyr Cys 
90 

ggg acc a 443 
Gly Thr 



<210> 185 
<211> 427 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 332. ,427 



<221> sigjpeptide 

<222> 332. .418 

<223> Von Heijne matrix 

score 7.09999990463257 
seq FCFXLCF6RSSLC/CR 



<400> 185 

taagtttata yhtctgaatc tgaaatcaga atatatatat ttaatttttc aattttaaaa 60 
atgttaccct gtgtgagaca aaaeaaaaea gtgaetagaa ccctccttgt gggctaaatt 120 
tgagtttgct tettcataat gttttaaatg ettcacaaac atttttcttt ggtatattga 180 
gcaaaatgaa ttgaagtata tttactgagt gatgattatt gaggaaaaac tcaaagatct 240 
gctgtaagca ctagagttga aggactagcc caacagctcc tcaggcaect ttgggtatat 300 
tgagttgccc cccctgactt tgaacacatc t atg gtc tgt gtc ate ttc aaa 352 

Met Val Cys Val He Phe Lys 
-25 

gag etc atg gaa ttt gaa ttc cct ggg ttt tgt ttt tgh ctt tgt ttt 400 
Glu Leu Met Glu Phe Glu Phe Pro Gly Phe Cys Phe Xaa Leu Cys Phe 

-20 -15 -10 

gga egg age teg etc tgt tgc cga rac 427 
Gly Arg Ser Ser Leu Cys Cys Arg Xaa 
-5 1 



<210> 186 
<211> 365 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 130. .363 



wo 99/53051 



PCT/IB99/00712 



110 

<221> sig_peptide 

<222> 130. .219 

<223> Von Heijne matrix 

score 7.09999990463257 

seq SCLALXTLAWYA/AL 



<400> 186 

aacgagtctt tgggaacgtg gtccacccag ggatgtaaaa ctgtgcttac cgatgcatcc 60 
catacgaaat gcttatgtga tcgtctctct accttcgcca ttttggctca gcaacctaga 120 
gaaataatc atg gaa tec tct ggc aca cct tea gtt acc eta ata gta ggc 171 
Met Glu Ser Ser Gly Thr Pro Ser Val Thr Leu lie Val Gly 

-30 -25 -20 



agt 


ggt 


ctt 


tct 


tgc 


ttg 


gcc 


ttg 


atb 


acc 


eta 


gca 


gtt 


gtc 


tat gca 


219 


Ser Gly 


Leu 


Ser 


Cys 


Leu 


Ala 


Leu 


Xaa 


Thr 


Leu Ala 


Val 


Val 


Tyr Ala 






-15 










-10 










-5 










gca 


tta 


tgg 


mgg 


tac 


ata 


cgc 


tct 


gag 


aga 


tec 


ata 


ata 


eta 


att aac 


267 


Ala 


Leu 


Trp Arg 


Tyr 


He 


Arg 


Ser 


Glu 


Arg 


Ser 


He 


He 


Leu 


He Asn 




1 








5 










10 










15 




ttc 


tgc 


ctg 


tct 


ate 


ate 


tea 


tee 


aat 


ate 


etc 


ata 


ctg 


gtt 


gga cag 


315 


Phe 


Cys 


Leu 


Ser 


lie 


He 


Ser 


Ser 


Asn 


He 


Leu 


He 


Leu 


Val 


Gly Gin 










20 










25 










30 






act 


cag 


aca 


cat 


aat 


aaa 


gag 


tat 


ctg 


cac 


aac 


cac 


cac 


tgc 


att ttt 


363 


Thr 


Gin 


Thr 


His 


Asn 


Lys 


Glu 


Tyr Leu 


His 


Asn 


His 


His 


Cys 


He Phe 





35 40 45 



gc 365 



<210> 187 
<211> 260 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 86. .259 



<221> sig_peptide 

<222> 86. .178 

<223> Von Heijne matrix 

score 7.09999990463257 
seq LXFLASSFCFGEA/DS 

<221> misc_feature 
<222> 143 

<223> n=a, g, c or t 



<400> 187 

ttttggaaea gggtaggcat tttgtttatt gtttgcttgc ttctaggtgt tttegccatc 60 
agggtgtatt ggaggctgac actta atg ggt gtg tgt tgc gcc cag aac tgc 112 

Met Gly Val Cys Cys Ala Gin Asn Cys 
-30 -25 
teg gtg teg ggg ktc waa agr aat gcg ctg ntg ttc ttg get tea agt 160 
Ser Val Ser Gly Xaa Xaa Arg Asn Ala Leu Xaa Phe Leu Ala Ser Ser 

-20 -15 -10 

ttc tgc ttt gga gaa gca gat tea gga agt agg tgt tgc tta aaa ata 208 
Phe Cys Phe Gly Glu Ala Asp Ser Gly Ser Arg Cys Cys Leu Lys He 

-5 15 10 

att ctt ggt ttt tat eta ate aga tat tea ttg att acc tac cag gtg 256 
He Leu Gly Phe Tyr Leu He Arg Tyr Ser Leu He Thr Tyr Gin Val 

15 20 25 

cgt g 260 
Arg 



wo 99/53051 



PCT/IB99/007I2 



<210> 188 
<211> 172 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 52. .171 



<221> sig_peptide 

<222> 52- .105 

<223> Von Heijne matrix 

score 7.09999990463257 
seq LFFFLKWSHPGWS/AT 

<221> misc_feature 
<222> 112 

<223> n=a, g, c or t 



<400> 188 

ttaatggaat atattatagt acactagcat gctggaaaga atgaaaataa t atg aaa 57 

Met Lys 

att ctt tac ctt ttt ttc ttt ttg aaa tgg agt cac cca ggc tgg agt 105 
He Leu Tyr Leu Phe Phe Phe Leu Lys Trp Ser His Pro Gly Trp Ser 

-15 -10 -5 

gca acg ncg tgg tct tgg cac act gca acc tec gcc tec ctg att caa 153 
Ala Thr Xaa Trp Ser Trp His Thr Ala Thr Ser Ala Ser Leu He Gin 
15 10 15 

gtg att etc ccg cct tgg g 172 
Val He Leu Pro Pro Trp 
20 



<210> 189 
<211> 150 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 47. .148 



<221> sig_peptide 

<222> 47. .124 

<223> Von Heijne matrix 

score 7.09999990463257 
seq LFLSGCFLFLSXC/XI 



<400> 189 

tatcacwtct aagagatttc tggtgaaact tgtggatttt ctatac atg aca cca 55 

Met Thr Pro 
-25 

tgt ttt ctg caa atg gac aat ttg act cct ctt ttc eta tct gga tgc 103 
Cys Phe Leu Gin Met Asp Asn Leu Thr Pro Leu Phe Leu Ser Gly Cys 
-20 -15 -10 

ttt tta ttt etc tct ewt tgc wtg att tat ttg get agg att ttg gg 150 
Phe Leu Phe Leu Ser Xaa Cys Xaa He Tyr Leu Ala Arg He Leu 
-5 15 



<210> 190 
<211> 339 
<212> DNA 
<213> Homo sapiens 



wo 99/53051 



PCT/IB99/007I2 



112 



<220> 
<221> CDS 
<222> 195. .338 

<221> sig_peptide 
<222> 195 . ,314 
<223> Von Heijne matrix 
score 7 

seq ITCKLCLCEQSXG/QD 



<400> 190 

agtcttgcaa agtgtaaagc tgtcagccgc agagcacgga ggaaagacgg agagaatgga 60 
agagctcctg tccggtgtgc cagcagcccg gactggcggt gagcgcgagg gaggctackg 120 
agaagcccgg cgacggagga acgcaggtct gctgccaggg attgaggaga ctgaagaacg 180 
ctgaagacag gctg atg ggc tea get ggt agg etc eae tat etc gse atg 230 
Met Gly Ser Ala Gly Arg Leu His Tyr Leu Xaa Met 
-40 -35 -30 

act get gaa aat ccc act cct gga gac ctg get ccg kcc ccc etc ate 278 
Thr Ala Glu Asn Pro Thr Pro Gly Asp Leu Ala Pro Xaa Pro Leu lie 

-25 -20 -15 

act tgc aaa etc tge ctg tgt gag eag tet ert gga caa gat gae eae 326 
Thr Cys Lys Leu Cys Leu Cys Glu Gin Ser Xaa Gly Gin Asp Asp His 

-10 -5 1 

act cea gga atg e 339 
Thr Pro Gly Met 
5 



<210> 191 
<211> 359 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 96. .359 



<221> sigjpeptide 
<222> 96. .242 
<223> Von Heijne matrix 
score 7 

seq VTVLLSAAPCLLS/CF 

<22l> mise_feature 
<222> 340 

<223> n=a, g, c or t 



<400> 191 

tacaagagtt tttgctgaaa gttttaagtt gataagatgc agagaattgg gggaatgtat 60 
aataaateag gtttcattgt tatattattt aecac atg aat eae ett eet cct 113 

Met Asn His Leu Pro Pro 
-45 

aac cat tat agg mgc cat gtg ttc aca tgt cat gtg gac eag tat tta 161 
Asn His Tyr Arg Xaa His Val Phe Thr Cys His Val Asp Gin Tyr Leu 

-40 -35 -30 

act gtg gaa acc gcg ggt ggc atg gag aag gag gca gtg tec gtg act 209 
Thr Val Glu Thr Ala Gly Gly Met Glu Lys Glu Ala Val Ser Val Thr 

-25 -20 -15 

gtg ctg etc tee gca gee ccc tgc ctg ctg tec tgt ttc etc ggc tec 257 
Val Leu Leu Ser Ala Ala Pro Cys Leu Leu Ser Cys Phe Leu Gly Ser 

-10 -5 1 5 

teg gtg tet gga ctg gcg ttc tgg gtt tec eag eag aaa act aaa ggg 305 



wo 99/53051 



Ser Val Ser Gly Leu Ala Phe Trp Val 
10 

cca gag agg tgt aaa aac aca cac cac 
Pro Glu Arg Cys Lys Asn Thr His His 
25 30 

gcg agg 
Ala Arg 



PCT/IB99/00712 

113 

Ser Gin Gin Lys Thr Lys Gly 
15 20 
tbg gca gnt aat aat ttc ccc 353 
Xaa Ala Xaa Asn Asn Phe Pro 
35 

359 



<210> 192 
<211> 264 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 138. .263 



<221> sig_peptide 
<222> 138. .257 
<223> Von Heijne matrix 
score 7 

seq FLFMLPLWCSIGT/CT 



<400> 192 

ttgagcttaa ggccaggtat atgggctcac acttgtaatc tcagtgcttt gggaggctga 60 
gggaaaagga tagcttgagt ccaggagttc gagatcatcc tgggcaacat agcaagatcc 120 
tgtctctaca aaaccta atg aac aaa att aaa gaa aac aca cac aca cac 170 
Met Asn Lys He Lys Glu Asn Thr His Thr His 
-40 -35 -30 

aca cac aca cac aca cac aaa aac aac acc aaa eta gtg tea aac eta 218 
Thr His Thr His Thr His Lys Asn Asn Thr Lys Leu Val Ser Asn Leu 

-25 -20 -15 

ttc ctt ttt atg tta cct etc tgg tgc tec att ggc act tgc aca g 264 
Phe Leu Phe Met Leu Pro Leu Trp Cys Ser He Gly Thr Cys Thr 
-10 -5 1 



<210> 193 
<211> 301 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 147. .299 



<221> sig_peptide 
<222> 147. .272 
<223> Von Heijne matrix 
score 7 

seq LFLYSLFTENVLA/HP 



<400> 193 

tgtattgttt mmmttattta ctagtatgca gatctggttt tcattctttt catattgaat 60 
ttcgttatgg gtagaatcat ttgcaaacat ttctagacat ttttaaagat ctatttaatt 120 
tgtttaagaa tggaaaaeat aaaata atg cat gat tet tea ggc aag aat aat 173 

Met His Asp Ser Ser Gly Lys Asn Asn 
-40 -35 
ttc aga aag ata cct gtt gta aat tta att tat etc tat gta gae ata 221 
Phe Arg Lys He Pro Val Val Asn Leu He Tyr Leu Tyr Val Asp He 

-30 -25 -20 

cat ata cat aaa tta ttt tta tat agt etc ttt aca gaa aat gta ttg 269 
His He His Lys Leu Phe Leu Tyr Ser Leu Phe Thr Glu Asn Val Leu 
-15 -10 -5 



wo 99/53051 PCT/IB99/00712 

114 

gca cat cct tgc att gtt eta cgc cgc eta tg 301 
Ala His Pro Cys He Val Leu Arg Arg Leu 
1 5 



<210> 194 
<211> 215 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 105. .215 



<221> sigjpeptide 
<222> 105. .203 
<223> Von Heijne matrix 
score 7 

seq LFFLFVGPFSCLG/SY 



<400> 194 

gctctgactg cagcctccca gggaatgcgc ggccgaggga atgcgcgcag tcacaggccc 60 
tgggagtgag ctggtgcccg gcgacctggc aeccgcgcct ggat atg ggg cgt eta 116 

Met Gly Arg Leu 
-30 

cat cgt ccc agg age age acc age tae agg aac etg ccg cat etg ttt 164 
His Arg Pro Arg Ser Ser Thr Ser Tyr Arg Asn Leu Pro His Leu Phe 

-25 -20 -15 

etg ttt ttc etc ttc gtg gga ccc ttc age tgc etc ggg agt tac age 212 
Leu Phe Phe Leu Phe Val Gly Pro Phe Ser Cys Leu Gly Ser Tyr Ser 

-10 -5 1 

egg 215 
Arg 



<210> 195 
<211> 209 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> CDS 
<222> 78. .209 



<221> sigjpeptide 
<222> 78. .158 
<223> Von Heijne matrix 
score 7 

seq RLLLLLLLXLPLP/PP 

<221> misc_feature 

<222> 73. .74 

<223> n=a, g, c or t 



<400> 195 

tcatteaetg attagatcca gcgetgagag gcagcactgc tccttctctc acgccaaetg 60 
agtctcttga tcnntae atg caa tec cag gca get cgc gaa cac aaa ccc 110 
Met Gin Ser Gin Ala Ala Arg Glu His Lys Pro 
-25 -20 
ggg g^c age cgc eta etg etg etg etg etg etg ewg etg ccg etg cct 158 
Gly Xaa Ser Arg Leu Leu Leu Leu Leu Leu Leu Xaa Leu Pro Leu Pro 

-15 -10 -5 

ccg ccg gkv etg cga acc egg gdy ttt tea wgc acc aca etc acc gem 206 
Pro Pro Xaa Leu Arg Thr Arg Xaa Phe Ser Xaa Thr Thr Leu Thr Ala 



wo 99/53051 PCT/IB99/00712 
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15 10 15 

ggg 209 

Gly 

<210> 196 
<211> S63 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 287. .361 



<221> sig_peptide 
<222> 287. .331 
<223> Von Heijne matrix 
score 7 

seq LWSLACLSPPAVQ/LG 



<400> 196 

ttacattgta atatataaat aattatacaa ctcaccataa cgtagaatca gtgggagccc 60 
tgagcttgtt ttcctgcaac tagatggtcc caactagacc aggtgatggg agacaatgac 120 
agatcattag gcattagatt atcataagga gcatacaacc tagatccctt gcatgtgcag 180 
ttaataatag gttttgcact tctatgagga tctaatgcgg cctctgatct gacaaggggc 240 
ggastcaggc agtaatggga gcaatgggga gcggttttca atacag atg agg ctt 2 95 

Met Arg Leu 
-15 

tgg tea ctt gcc tgc ctt tea ect cct get gtg cag ctt ggt tec caa 343 
Trp Ser Leu Ala Cys Leu Ser Pro Pro Ala Val Gin Leu Gly Ser Gin 

-10 -5 1 

cag gcc acg gae tgg tgg te 363 
Gin Ala Thr Asp Trp Trp 
5 10 

<210> 197 
<211> 155 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 58. .153 

<221> sig_peptide 
<222> 58. .132 
<223> Von Heijne matrix 
score 7 

seq IFSFFFFITLVRG/SI 



<400> 197 

tagtgttatt catagtagta tetgaagacc ttttgtatte ttgtgggatc agttgta 57 

atg tea cct ttg ttt att ctg att gtg ctt att tgg ate ttc tet tte 105 

Met Ser Pro Leu Phe lie Leu lie Val Leu He Trp He Phe Ser Phe 

-25 -20 -15 -10 

ttt ttc ttt att act eta gtt agg ggg tet ate aat ctt ttt ttt ttt 153 

Phe Phe Phe He Thr Leu Val Arg Gly Ser He Asn Leu Phe Phe Phe 

-5 15 
tt 155 



<210> 198 
<211> 135 
<212> DNA 

<213> Homo sapiens 



wo 99/53051 
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<220> 
<221> CDS 
<222> 60. .134 

<221> sig_peptide 
<222> 60. .125 
<223> Von Heijne matrix 
score 7 

seq STFLFFLFFSVFC/FF 
<400> 198 

ttgcctctta aaaggccaca cttcttaata ctatcaaatt ggctattaag tttcaacaa 59 
atg aat ttg ggg gga cat tea gat cat age act ttt ctt ttc ttt ctt 107 
Met Asn Leu Gly Gly His Ser Asp His Ser Thr Phe Leu Phe Phe Leu 

-20 -15 -10 

ttt ttt tct gtt ttt tgt ttt ttt ttt t 135 
Phe Phe Ser Val Phe Cys Phe Phe Phe 
-5 1 

<210> 199 
<211> 320 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 46. .318 

<221> sig_peptide 

<222> 46. .108 

<223> Von Heijne matrix 

score 6,90000009536743 

seq VTFLLALVGAVLY/LY 

<221> misc_feature 
<222> 9 

<223> nsa, g, c or t 



<400> 199 

gctggagcng ccgatccgag acgtggcthc ctgggcggca gaacc atg ttg gac ttc 57 

Met Leu Asp Phe 
-20 

ctg gcg ttg gtg gga gcc gtg etc 105 
Leu Ala Leu Val Gly Ala Val Leu 

-5 

get gca gga att cca ggg att act 153 
Ala Ala Gly He Pro Gly He Thr 

10 15 
ctt cca gat att gtg aat agt gga 201 
Leu Pro Asp He Val Asn Ser Gly 

25 30 
ttg cat gag aga tat ggg cct gtg 249 
Leu His Glu Arg Tyr Gly Pro Val 
40 45 

etc gtg gtt agt ttg ggc act gtt 297 
Leu Val Val Ser Leu Gly Thr Val 
60 

320 



gcg 


ate 


ttc 


gcc 


gtt 


ace 


ttc 


ttg 


Ala 


He 


Phe 


Ala 


Val 


Thr 


Phe 


Leu 






-15 










-10 


tac 


etc 


tat 


ccg 


get 


tec 


aga 


caa 


Tyr 


Leu 
1 


Tyr 


Pro 


Ala 


Ser 
5 


Arg 


Gin 


cca 


act 


gaa 


gaa 


aaa 


gat 


ggt 


aat 


Pro 


Thr 


Glu 


Glu 


Lys 


Asp 


Gly 


Asn 










20 








agt 


ttg 


cat 


gag 


tbc 


ctg 


gtt 


aat 


Ser 


Leu 


His 


Glu 


Xaa 


Leu 


Val 


Asn 








35 










gtc 


tec 


ttc 


tgg 


ttt 


ggc 


agg 


cgc 


Val 


Ser 


Phe 


Trp 


Phe 


Gly 


Arg 


Arg 






50 










55 


gat 


gta 


ctg 


aag 


eag 


cat 


egg 


99 


Asp 


Val 


Leu 


Lys 


Gin 


His 


Arg 






65 










70 
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<210> 200 
<211> 125 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 
<222> 40. .123 

<221> sig_peptide 

<222> 40. .93 

<223> Von Heijne matrix 

score 6.90000009536743 

seq LELLGSSSPPISA/SQ 

<400> 200 

cttcctcagt cacccaggct ggagtacagt ggcataatc atg get cac tgc age 54 

Met Ala His Cys Ser 
-15 

tta gaa etc ttg gge tea age agt cct ccc ate tea gcc tee eaa age 102 
Leu Glu Leu Leu Gly Ser Ser Ser Pro Pro He Ser Ala Ser Gin Ser 

-10 -5 1 

act gga att aca age gtg age ca . 12 5 

Thr Gly He Thr Ser Val Ser 
5 10 

<210> 201 
<211> 210 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 78. .209 

<221> sigjeptide 

<222> 78. . 128 

<223> Von Heijne matrix 

score 6.90000009536743 

seq LLLLSLSLFLFPW/RQ 

<400> 201 

teaggtttte ctcettcccg ggtgctctga agttteaeca tgaatcacct tgcaggggct 60 
ctttttattt tttattg atg ccc age eag ttg ttg ttg ttg tct ctt tct 110 
Met Pro Ser Gin Leu Leu Leu Leu Ser Leu Ser 
-15 -10 
etc ttt ttg ttt ttt tgg aga eag agt etc gtt ttg tgg ccc agg ctg 158 
Leu Phe Leu Phe Phe Trp Arg Gin Ser Leu Val Leu Trp Pro Arg Leu 

-5 15 10 

gag tgc agt tgt gtc att gcg get cac tgc age ctg ace tec eag get 206 
Glu Cys Ser Cys Val He Ala Ala His Cys Ser Leu Thr Ser Gin Ala 

15 20 25 

egg g 210 
Arg 

<210> 202 
<211> 338 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 89. .337 
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<221> sig_peptide 

<222> 89. .226 

<223> Von Heijne matrix 

score 6.90000009536743 
seq CLFCCXFISSCNS/VF 

<221> misc_f eature 
<222> 291 

<223> n=a, g, c or t 



<400> 202 

aattataata atatactaaa atatgtacga atatatacta ataattagta tataatgaat 60 
cagtataaaa tatataatat acactaat atg tat act aat aaa tat aca eta 112 

Met Tyr Thr Asn Lys Tyr Thr Leu 





















-45 










-40 






ata 


tat 


aac 


ata 


eta 


ata 


tat 


aat 


ata 


tgt 


btk 


drg 


tat 


atg 


tgg 


ttg 


160 


He 


Tyr 


Asn 


He 


Leu 


He 


Tyr 


Asn 


He 


Cys 


Xaa 


Xaa Tyr 


Met 


Trp 


Leu 










-35 










-30 










-25 








ata 


etc 


att 


tat 


atg 


tac 


eta 


cat 


att 


tgc 


etc 


ttt 


tgt 


tgc 


wet 


ttt 


208 


He 


Leu 


He 
-20 


Tyr 


Met 


Tyr 


Leu 


His 
-15 


He 


Cys 


Leu 


Phe 


Cys 
-.10 


Cys 


Xaa 


Phe 




att 


tct 


tec 


tgc 


aat 


tet 


gtg 


ttt 


eee 


tgt 


gtg 


att 


atb 


ttt 


ctt 


Gtg 


256 


He 


Ser 

-5 


Ser 


Cys 


Asn 


Ser 


Val 

1 


Phe 


Pro 


Cys 


Val 

5 


He 


Xaa 


Phe 


Leu 


Leu 

10 




cct 


gaa 


gaa 


ctt 


ctt 


twt 


gtd 


twt 


ctd 


wdw 


dtg 


tnt 


tty 


wtt 


gtg 


aga 


304 


Pro 


Glu 


Glu 


Leu 


Leu 


Xaa 


val 


Xaa 


Leu 


Xaa 


Xaa 


Xaa 


Phe 


Xaa Val Arg 












15 










20 










25 






tgg 


agt 


etc 


amt 


cwg 


teg 


tec 


agg 


etg 


gag 


tge 


a 










338 


Trp Ser 


Leu 


Xaa 


Xaa 


Ser 


Ser 


Arg 


Leu 


Glu 


Cys 















30 35 



<210> 203 
<211> 188 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 
<222> 84 . . IBS 

<221> sig_peptide 

<222> 84 . .176 

<223> Von Heijne matrix 

score 6.90000009536743 

seq LWSLIQAVHICLG/RK 

<400> 203 

tattctctga ttekctgtct tggaatgeat ttaaaatcte tgcetcgatt ctgacctace 60 
tggcatggga acaagaattt aca atg tta etc aec cac aat gaa gat tac atg 113 

Met Leu Leu Thr His Asn Glu Asp Tyr Met 
-30 -25 
cct gge aat ttd gre ttw ard daw ttg tgg age tta att cag get gtt 161 
Pro Gly Asn Xaa Xaa Xaa Xaa Xaa Leu Trp Ser Leu He Gin Ala Val 

-20 -15 -10 

cat ate tgc eta gge agg aaa aaa aaa 188 
His He Cys Leu Gly Arg Lys Lys Lys 
-5 1 

<210> 204 
<211> 347 
<212> DNA 



wo 99/53051 

<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 81. .347 



<221> sigjpeptide 

<222> 81. .137 

<223> Von Heijne matrix 

score 6.90000009536743 
seq WVFLVAIIKGVQC/QV 



<400> 204 

agctctggga gaagagcccc agccccagaa ttcccaggag tttccattcg gtgatcagca 60 
ctgaacacag aggactcacc atg gag ttt ggg ctg age tgg gtt ttc ctt gtt 113 

Met Glu Phe Gly Leu Ser Trp Val Phe Leu Val 
-15 -10 

161 



209 



get 


att 


ata 


aaa 


ggt 


gtc 


cag 


tgt 


cag 


gtg 


caa 


ctg 


gtg 


gag 


tct 


999 


Ala 


lie 


He 


Lys 
-5 


Gly 


Val 


Gin 


Cys 


Gin 
1 


Val 


Gin 


Leu 


Val 
5 


Glu 


Ser 


Gly 


999 


ggc 


ttg 


gtc 


aag 


cct 


gga 


999 


tec 


eta 


aga 


etc 


tec 


tgt 


gea 


gee 


Gly Gly 


Leu 


Val 


Lys 


Pro Gly Gly Ser Leu Arg 


Leu 


Ser 


Cys 


Ala 


Ala 




10 










15 










20 










tct 


gga 


ttc 


acc 


ttc 


agt 


gay 


tac 


waw 


atr 


act 


kgg 


att 


cgc 


mag 


gee 


Ser 


Gly 


Phe 


Thr 


Phe 


Ser Asp 


Tyr Xaa Xaa 


Thr 


Xaa 


He 


Arg 


Xaa 


Ala 


25 










30 










35 










40 


cma 


999 


aag 


ggs 


ytg 


rak 


tgg 


att 


yea 


tarn 


atw 


acg 


act 


agt 


999 


aat 


Xaa Gly 


Lys 


Gly Leu 


Xaa 


Trp 


He 


Xaa 


Xaa 


He 


Thr 


Thr 


Ser 


Gly Asn 










45 










50 










55 




acc 


gca 


awy 


tac 


gca 


gwc 


tct 


gta 


aag 


gsc 


eg a 


ttc 


acc 


ate 






Thr 


Ala 


Xaa 


Tyr 


Ala 


Xaa 


Ser 


Val 


Lys 


Xaa 


Arg 


Phe 


Thr 


He 












60 










65 










70 







257 



305 



347 



<210> 205 
<211> 440 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 276. .440 



<221> sig_peptide 

<222> 276. .326 

<223> Von Heijne matrix 

score 6.90000009536743 
seq FLFVCLXFDESCS/VT 



<400> 205 

cagtaaattt ttttgwwcat tggttcette dkcttggaet ctctgttage acaectgatc 60 
agagttggce gtgttgtaat tctttccctc tctgctgcaa tgttgtttac ttctacctgs 120 
caaetawket ttataettte ettttttgee atgaagggaa taeatttttt ccttcttgtt 180 
gggetataca gtgatectca tcaacaaatt ateaaagaac tgtatgagga aaaggtetet 240 
ttttttaaaa gtgaatcagg gctggggagt tagga atg aag agg ttt ttt ttg 293 

Met Lys Arg Phe Phe Leu 
-15 

ttt gtt tgt ttg tww ttt gac gag tct tgc tct gtc acc agg ctg ggg 341 
Phe Val Cys Leu Xaa Phe Asp Glu Ser Cys Ser Val Thr Arg Leu Gly 

-10 -5 1 5 

tgc tgt ggc gcg ate tea gee cac tgc aam etc cga etc cct ggt tea 389 
Cys Cys Gly Ala He Ser Ala His Cys Xaa Leu Arg Leu Pro Gly Ser 

10 15 20 

age rat dyt cct gee tea acc tec cga gta gvy ggg att aca ggc atg 437 



wo 99/53051 
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120 

Ser Xaa Xaa Pro Ala Ser Thr Ser Arg Val Xaa Gly lie Thr Gly Met 

25 30 35 

cgc 440 
Arg 



<210> 206 
<211> 283 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 162 . .281 

<221> sig_peptide 

<222> 162. .275 

<223> Von Heijne matrix 

score 6.90000009536743 
seq CMLFVSFLLLLLG/SR 



<400> 206 

aataactccc tttagcattt cttgtaggac aggtctgatg ttgatgaaat ctctcatctt 60 
gtttgtcaga gaaagtcttt atttctcctt catgcttgaa ggatgtttcc accggatata 120 
ctatcctagg gtaaaagttt ttttccttca gcactttaaa t atg tea tgc cac tct 176 

Met Ser Cys His Ser 

-35 

ctt ctg gee tgt aag gtt ttc act gaa aag tct cct acc aaa cat att 224 
Leu Leu Ala Cys Lys Val Phe Thr Glu Lys Ser Pro Thr Lys His lie 

-30 -25 -20 

aga gag eac cat tgt atg tta ttt gtt tct ttt etc ttg ctg ctt tta 272 
Arg Glu His His Cys Met Leu Phe Val Ser Phe Leu Leu Leu Leu Leu 

-15 -10 -5 

gga tec egg gg 283 
Gly Ser Arg 
1 



<210> 207 
<211> 264 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 113. .262 



<221> sig_peptide 

<222> 113 . .190 

<223> Von Heijne matrix 

score 6.90000009536743 
seq LLSMLSCCQGACC/PS 



<400> 207 

gacggcggag agcagagagg gagcgcgcct tggctcgctg gccttggcgg cggctcctca 60 
ggagagctgg ggcgcccacg agaggatcce tcacecgggt ctctcctcag gg atg aca 118 

Met Thr 
-25 

tea tec gtc cac etc ctt gte ttc aag gae cac etc etc tec atg ctg 166 
Ser Ser Val His Leu Leu Val Phe Lys Asp His Leu Leu Ser Met Leu 

-20 -15 -10 

age tgc tgc caa ggg gee tgc tgc cca tct aca cct cac gag ggc act 214 
Ser Cys Cys Gin Gly Ala Cys Cys Pro Ser Thr Pro His Glu Gly Thr 

-5 1 5 

agg age aeg gtt tec tgg ate cea cca aca tac aaa gea gee aca eag 262 



wo 99/53051 PCT/IB99/00712 
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Arg Ser Thr Val Ser Trp lie Pro Pro Thr Tyr Lys Ala Ala Thr Gin 
10 15 20 

gg 264 



<210> 208 

<211> 422 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> 352. .420 



<221> sig_peptide 

<222> 352. .408 

<223> Von Heijne matrix 

score 6.80000019073486 
seq LLSMFCVSHTVQT/AT 

<221> misc_feature 
<222> 289. .290 
<223> n=a, g, c or t 



<400> 208 

aaaataaaag 

cacagatttg 

agtagacagt 

tgtaatgaag 

gagtgtgtta 

ttttttcttc 



tcttcttgat 
tggcttttta 
caaaggctag 
tgaattgaat 
ggcacttatt 
tctccgtgta 



ttccagtgta 
tgaaatacac 
atcactgtra 
cttgcttctt 
aacaaaagta 
gttctaaaaa 



ttcctcctgc 
ctgtagatta 
tgagtagggc 
ttgggtcacc 
actcagaatt 
tgaccatatg 



aga gcg tct att ctt ctt age atg ttc tgt gtg 
Arg Ala Ser He Leu Leu Ser Met Phe Cys Val 

-15 -10 
aca gca aca tac aca ca 
Thr Ala Thr Tyr Thr 
1 



acwttttggc ctgtttggac 
atttwcagtt thtwhayggw 
ttccacattt aagaaaaagc 
caaaagcagt gataagtgct 
gctgtctann cctccatatc 
atattccttg a atg gta 
Met Val 
tea cac act gtg cag 
Ser His Thr Val Gin 
-5 



60 
120 
180 
240 
300 
357 

405 



422 



<210> 209 
<211> 195 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 39. .194 



<221> sig_peptide 

<222> 39. .89 

<223> Von Heijne matrix 

score 6.80000019073486 
seq ALSSFTWWAPACC/AP 



<400> 209 

agccactgca cctgggctca cagtttaaat cttgagta atg gag aaa aca gcc ttg 56 

Met Glu Lys Thr Ala Leu 
-15 

tea tec ttt acg tgg tgg gca cet gee tgc tgt get cca cgt aca tae 104 
Ser Ser Phe Thr Trp Trp Ala Pro Ala Cys Cys Ala Pro Arg Thr Tyr 

-10 -5 15 

gtg gtg tct gca aca act ctg tea get gtg caa ggt cae tgt cet eta 152 
Val Val Ser Ala Thr Thr Leu Ser Ala Val Gin Gly His Cys Pro Leu 
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10 15 20 

cag agt aga aca teg acc aaa gga aag tta tgg ccg ttt ggg g 195 
Gin Ser Arg Thr Ser Thr Lys Gly Lys Leu Trp Pro Phe Gly 
25 30 35 

<210> 210 
<211> 363 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 212. .361 

<221> sig_peptide 

<222> 212 . .280 

<223> Von Heijne matrix 

score 6.80000019073486 

seq KLLLSGLTQECLG/AL 

<400> 210 

taattttcat catctaaact gaatgcaaac agcgcttggc aattaaaatg aagctctcca 60 
atgaagtata cttcatcagc tgctgtcaag tcatccattg atactgtttt gcggttttta 120 
aattcctttt gtcactgtga ctgctcatca gcaggcaagg aagagcaggc aacaaaagtt 180 
gaaaagtgca tgaaggaaaa ctttgaggaa t atg ata ttc aca ttc cag caa 232 

Met He Phe Thr Phe Gin Gin 
-20 

att ggg gga aaa ctg eta tta tct ggt tta aca cag gag tgc ctt ggt 280 
He Gly Gly Lys Leu Leu Leu Ser Gly Leu Thr Gin Glu Cys Leu Gly 

-15 -10 -5 

gcc ctg cct gag get aat gtg ttc tgt agg ggt ggc tgc aca gcc aca 328 
Ala Leu Pro Glu Ala Asn Val Phe Cys Arg Gly Gly Cys Thr Ala Thr 
15 10 15 

gtc ctg aaa cat ggg aaa gca tct cct gag tec ag 363 
Val Leu Lys His Gly Lys Ala Ser Pro Glu Ser 
20 25 

<210> 211 
<211> 368 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 230 . .367 

<221> sig_jpeptide 

<222> 230. .322 

<223> Von Heijne matrix 

score 6.80000019073486 

seq LLALSPDLQAARG/LM 

<400> 211 

acagagaacc ctgcttcaaa gcagaagtag cagttccgga gtccagctgg ctaaaactca 60 
tccchyggat aatggcaacc catgccttag aaatcgctgg gctgtttctt ggtggtgttg 120 
gmatgggtgg gsacmrgkwg ggbkgyvack gtcatgcctc agtggdrrag tgteggcctt 180 
cattgaaaac aacatcgtgg tttttgaaaa cttctgggaa ggactgttg atg aat tgc 238 

Met Asn Cys 
-30 

9tg agg cag get aac ate agg atg cag tgc aaa ate tat gat tec ctg 286 
Val Arg Gin Ala Asn He Arg Met Gin Cys Lys He Tyr Asp Ser Leu 

-25 -20 -15 

ctg get ctt tct ccg gac eta cag gca gcc aga ggr ctg atg tgt get 334 
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Leu Ala Leu Ser Pro Asp Leu Gin Ala Ala Arg Gly Leu Met Cys Ala 

-10 -5 1 

get tec gtg atg tec ttc ttg get ttc atg atg g 368 
Ala Ser Val Met Ser Phe Leu Ala Phe Met Met 
5 10 15 



<210> 212 
<211> 448 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 316. .447 



<221> sig_peptide 

<222> 316. .435 

<223> Von Heijne matrix 

score 6.80000019073486 
seq LLKLLISLRSFWA/ET 



<400> 212 

ttgtcttggc tatacgggat cttttttggt cccatatgaa atttaagtag cttttcctaa 60 
ttctgtgaag gaagtcaatg gtagcttgat gggaatagca ttgaatctat aaattacttt 120 
gggccgtatg gcatttgggc aatattgatt cttcctatte atgagcatgg aatgtttttc 180 
catttgttca tgtcctctct tattttgttg agcagtggtt tgtagttctc cttgaagggg 240 
ttcttcaeat cccttgtaag ttgtattccc aggtatttta ttctctttgt agcaattttg 300 
aatgggagtt cactc atg att tgg etc tct ttt tgt eta tta ttg gtg tat 351 
Met lie Trp Leu Ser Phe Cys Leu Leu Leu Val Tyr 
-40 -35 -30 

agg aat get tgt gat ttt tgc aca ttg act tta tat cct ggg act ttg 399 
Arg Asn Ala Cys Asp Phe Cys Thr Leu Thr Leu Tyr Pro Gly Thr Leu 

-25 -20 -15 

ctg aag ttg ctt ate age tta agg agt ttt tgg get gag aeg acg ggg g 448 
Leu Lys Leu Leu lie Ser Leu Arg Ser Phe Trp Ala Glu Thr Thr Gly 
-10 -5 1 



<210> 213 
<211> 158 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 28. .156 



<221> sig_peptide 

<222> 28. .102 

<223> Von Heijne matrix 

score 6.69999980926514 
seq LVGSLHLFLSVLA/SK 



<400> 213 

gcgctgggag ttctettttt cacttga atg ttt tct tct cca ggg ctg agg acg 54 

Met Phe Ser Ser Pro Gly Leu Arg Thr 
-25 -20 
etc ttt gta ttg gta ggc age ctg cac ttg ttc ctt tea gtc ctg gca 102 
Leu Phe Val Leu Val Gly Ser Leu His Leu Phe Leu Ser Val Leu Ala 

-15 -10 -5 

agt aaa age agg aat tct aaa aag caa cga tta ttc etc eta gtt cct 150 
Ser Lys Ser Arg Asn Ser Lys Lys Gin Arg Leu Phe Leu Leu Val Pro 
15 10 15 

ttg tac ag 158 
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Leu Tyr 

<210> 214 
<211> 193 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 39. .191 

<221> sigjpeptide 
<222> 39. .107 
<223> Von Heijne matrix 

score 6.69999980926514 

seq NVCSLPAPGLCSG/QP 

<400> 214 

aagaaaagct ttgggtcaac tcagcatcat gtttgcag atg ctg aca gac ggg ate 

Met Leu Thr Asp Gly lie 

-20 

eta atg aga gtc aat gtg tgc tea ctg cca get cet ggg ctg tgc tct 
Leu Met Arg Val Asn Val Cys Ser Leu Pro Ala Pro Gly Leu Cys Ser 

-15 -10 -5 

ggt cag cca ggt gtg agg gee tgg cet ggg gtc aca cag ctg act car 
Gly Gin Pro Gly Val Arg Ala Trp Pro Gly Val Thr Gin Leu Thr Gin 

15 10 15 

bta gag gaa tgc cca tgg ttc tea gca ttg gaa gga ctg gg 
Xaa Glu Glu Cys Pro Trp Phe Ser Ala Leu Glu Gly Leu 
20 25 

<210> 215 
<211> 214 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 67. .213 

<221> sig_j)eptide 
<222> 67. .165 
<223> Von Heijne matrix 

score 6.69999980926514 

seq ILLLSLIFGPCIL/NS 

<400> 215 

aaagttcgag aaaatchaga taggcaccaa caagaacgag aaaataacat cccctggtat 60 
caaagc atg ttt aac tgg aac cca tgg eta act act tta ate act ggg 108 
Met Phe Asn Trp Asn Pro Trp Leu Thr Thr Leu lie Thr Gly 
-30 -25 -20 

wta geh gga cet etc etc ate eta eta tta agt tta att ttt ggg cet 156 
Xaa Ala Gly Pro Leu Leu He Leu Leu Leu Ser Leu He Phe Gly Pro 

-15 -10 -5 

tgt ata tta aat teg ttt ctk aat tkt ata aaa caa cgc ata get tct 204 
Cys He Leu Asn Ser Phe Leu Asn Xaa He Lys Gin Arg He Ala Ser 

15 10 
ggc aaa egg g 214 
Gly Lys Arg 
15 



56 
104 
152 
193 



<210> 216 
<211> 327 
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<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 
<222> 22. .327 



<221> sig_peptide 

<222> 22. .108 

<223> Von Heijne matrix 

score 6.69999980926514 
seq FCALLLSLXXXXP/XX 



<400> 216 

ctccgcgttc cagaatccaa g 



gcc 


aga 


gtt 


egg 


ccg 


ctg 


Ala 


Arg 


val 


Arg 


Pro 


Leu 










-15 




mty 


ckt 


ccg 


rkg 


cka 


cgs 


Xaa 


Xaa 


Pro 


Xaa 
1 


Xaa 


Arg 


aca 


teg 


ccg 


ttt 


cga 


gta 


Thr 


Ser 


Pro 


Phe 


Arg 


Val 




15 










gca 


gag 


cga 


egg 


gac 


cgt 


Ala 


Glu 


Arg 


Arg Asp Arg 


30 










35 


tec 


aag 


tte 


aga 


tea 


aat 


Ser 


Lys 


Phe 


Arg 


Ser 


Asn 










50 




agg 


etc 


agt 


gtg 


gvm 


aam 


Arg 


Leu 


Ser 


Val 


Xaa 


Xaa 



65 



atg gcg gga tec agg caa 
Met Ala Gly Ser Arg Gin 
-25 

tte tgc gcc ttg ctg ctg 
Phe Cys Ala Leu Leu Leu 
-10 

cgt gkg agg aga ccc cgc 
Arg Xaa Arg Arg Pro Arg 
5 

saa ata cag ctt caa ggg 
Xaa lie Gin Leu Gin Gly 
20 25 
gcc ctt ctg ggm cca cgc 
Ala Leu Leu Gly Pro Arg 
40 

teg agt age ace ate ttt 
Ser Ser Ser Thr lie Phe 
55 

gac aav age gga cct ggg 
Asp Xaa Ser Gly Pro Gly 
70 



^99 99t etc egg 51 
Arg Gly Leu Arg 
-20 

tea etm sgw hsv 99 
Ser Leu Xaa Xaa 
-5 

ggt cgc gtt gee 147 

Gly Arg Val Ala 

10 

gee gca cct ggt 195 
Ala Ala Pro Gly 

999 9aa tgc tat 243 
Gly Glu Cys Tyr 
45 

aaa aag cya aag 291 
Lys Lys Xaa Lys 
60 

327 



<210> 217 
<211> 357 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 70. .357 



<221> sig_peptide 

<222> 70. .126 

<223> Von Heijne matrix 

score 6.69999980926514 

seq WVFLVAILKGVHC/DV 

<400> 217 

aggagcecca geeetgggat tcccagctgt ttctgcttgc tgatcaggac tgcacacaga 60 
gaaeteaec atg gag ttt ggg ctg age tgg gtt tte ctt gtt get att tta 111 
Met Glu Phe Gly Leu Ser Trp Val Phe Leu Val Ala lie Leu 
-15 -10 
aaa ggt gte cae tgt gac gtg cag ctg gtg gag tec ggg gga ggt tta 159 
Lys Gly Val His Cys Asp Val Gin Leu Val Glu Ser Gly Gly Gly Leu 
-5 15 10 

gtt cag cct ggg ggg tec ctg aga etc tec tgt gca gcc tet gga etc 207 
Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Leu 

15 20 25 

ace etc agt aac gac tgg atg cae tgg gte cgc caa gee cca ggg aag 255 
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Thr Leu Ser Asn Asp Trp Met His Trp Val Arg Gin Ala Pro Gly Lys 

30 35 40 

999 ctg gtg tgg gtc tea cac att gat agt tct rgg act ate aca aat 303 
Gly Leu Val Trp Val Ser His lie Asp Ser Ser Xaa Thr lie Thr Asn 

45 50 55 

tac gcg gac tec gtg aag ggc cga ttc acc ate tec aga gae aac gee 351 
Tyr Ala Asp Ser Val Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ala 
60 65 70 75 

aag tgg 357 
Lys Trp 

<210> 218 
<211> 189 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 74. .187 



<221> sig_peptide 

<222> 74. .118 

<223> Von Heijne matrix 

score 6.69999980926514 
seq LFLGFLACSVAYQ/CH 



<400> 218 

ttatcaagga cactgtcttt tcgccatcat gtgttcttgg cccctctgtt gaaattcaat 60 
ctateataga caa atg ggt tta ttt ctg ggc ttt eta gee tgt tct gtt 109 
Met Gly Leu Phe Leu Gly Phe Leu Ala Cys Ser Val 
-15 -10 -5 

gca tac cag tgc cat tct get ttt gtt act gta get tea cag tac act 157 
Ala Tyr Gin Cys His Ser Ala Phe Val Thr Val Ala Ser Gin Tyr Thr 

15 10 
ttg aaa tea gag act ttg atg ccc gca gcg gg 189 
Leu Lys Ser Glu Thr Leu Met Pro Ala Ala 
15 20 



<210> 219 
<211> 353 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 41. .352 



<221> sig_peptide 

<222> 41 . . 187 

<223> Von Heijne matrix 

score 6.69999980926514 

seq FLGLIFFLELATG/IL 

<400> 219 

agttgacttg ceatctgect tgcaggatgg eatceagece atg tgg gag gac age 55 

Met Trp Glu Asp Ser 
-45 

agg aat aaa egg ggt ggc cgc tgg ctg gtc age ctg gee aag cag cag 103 

Arg Asn Lys Arg Gly Gly Arg Trp Leu Val Ser Leu Ala Lys Gin Gin 

-40 -35 -30 

cgc cac att gag ctg gac egg ctg tgg ctg gag acg ttc tec gtg ttc 151 

Arg His He Glu Leu Asp Arg Leu Trp Leu Glu Thr Phe Ser Val Phe 
-25 -20 -15 
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etc 




etc 


ate 


ttc ttc 


ctg gag ctg gea aca ggg 


ate 


ctg 


gee 


ttt 


1 QQ 


Leu 


Gly 


Leu 


lie 


Phe Phe 


Leu Glu Leu Ala Thr Gly lie Leu Ala Phe 








-10 






-5 


1 










gtc 


ttc 


aag 


gae 


tgg att 


cga gac cag etc aac etc 


ttc 


ate 


aac 


aac 


247 


Val 


Phe 


Lys 


Asp 


Trp lie 


Arg Asp Gin Leu Asn Leu 


Phe 


He 


Asn 


Asn 




5 








10 


15 








20 




aac 


gtc 


aag 


gee 


tac egg 


gac gac att gac etc cag 


arc 


etc 


att 


gac 


295 


Asn 


Val 


Lys 


Ala 


Tyr Arg Asp Asp lie Asp Leu Gin 


Xaa 


Leu 


He 


Asp 












25 


30 






35 






ttt 


get 


cag 


gaa 


tac tgg 


tct tgc tgc gga sec gag 


gee 


cea 


ata 


rdt 


343 


Phe 


Ala 


Gin 


Glu 


Tyr Trp 


Ser Cys Cys Gly Xaa Glu 


Ala 


Pro 


He 


Xaa 










40 




45 




50 








gga 


acc 


ggg 


9 














353 


Gly 


Thr 


Gly 



















55 



<210> 220 
<211> 115 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 12.. 113 

<221> sig_peptide 

<222> 12. .53 

<223> Von Heijne matrix 

score 6.59999990463257 

seq FLSLSTAFWWYA/MI 

<400> 220 

actagcattt e atg ttt tta tct etc tct act gea ttc tgg gta gtt tat 50 
Met Phe Leu Ser Leu Ser Thr Ala Phe Trp Val Val Tyr 
-10 -5 
gee atg ata att tat tea get etc tct get gga ttt att att ttc ttt 98 
Ala Met He He Tyr Ser Ala Leu Ser Ala Gly Phe He He Phe Phe 

15 10 15 

tta gtt gtg ttt aat ct 115 
Leu Val Val Phe Asn 
20 

<210> 221 
<211> 142 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 29. .142 

<221> sigjpeptide 

<222> 29. .130 

<223> Von Heijne matrix 

score 6.59999990463257 

seq FFLFFCFETGSHS/VT 

<400> 221 

cctgcccatt gcttcaacct gcacctct atg tac att gtg atg gat eta cct 52 

Met Tyr He Val Met Asp Leu Pro 

-30 

eta tgg etc tec cat gag gtc caa tct tat att cct tct ttc ttc ett 100 
Leu Trp Leu Ser His Glu Val Gin Ser Tyr He Pro Ser Phe Phe Leu 



wo 99/53051 

-25 

ttt ttt tgc ttt gag act 
Phe Phe Cys Phe Glu Thr 
-10 -5 

<210> 222 
<211> 370 
<212> DNA 
<213> Homo sapiens 



128 

-20 -15 
ggg tct cac tct gtc acc 
Gly Ser His Ser Val Thr 
1 



PCT/1B99/00712 

cac ggg 142 
His Gly 



<220> 
<221> CDS 
<222> 209. .370 



<221> sig_peptide 

<222> 209. .289 

<223> Von Heijne matrix 

score 6.59999990463257 
seq LAFSFSFFPSSFS/SF 



<400> 222 

ttttggatac atatgactca tgtgacatta gatcacagca tttttgtttt tattattaat 60 
atattgcctt agaactacat tgctaaacct ggtctttgta tctgcgaagt tctaacatct 120 
tgccacagct tagttagctt tgagagggaa agggtagaat ccatttaagg agacaggtta 180 
aaaaatgata tatttaagca tataggca atg gta gca cat gat tac caa aac 232 

Met Val Ala His Asp Tyr Gin Asn 
-25 -20 
ata att age ctt ttc ttt ctt get ttt tea ttt tct ttc ttt cct tct 280 
lie lie Ser Leu Phe Phe Leu Ala Phe Ser Phe Ser Phe Phe Pro Ser 

-15 -10 -5 

tea ttt tct tct ttc ttt ctt ktc ttt ctt tct ttt ttc tct tct ttc 328 
Ser Phe Ser Ser Phe Phe Leu Xaa Phe Leu Ser Phe Phe Ser Ser Phe 

15 10 
ttt etc tct ctt ctt tct ttc ect tec ttc etc ccc ccc ggr 370 
Phe Leu Ser Leu Leu Ser Phe Pro Ser Phe Leu Pro Pro Gly 
15 20 25 



<210> 223 
<211> 431 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 22. .429 



<221> sig_peptide 

<222> 22. .66 

<223> Von Heijne matrix 

score 6.59999990463257 
seq ALRALCGFRGVM/QV 



<400> 223 



gcagtctgca gccggagtaa g 


atg 


gcg 


gcg 


ctg 


agg get 


ttg 


tgc 


ggc 


ttc 


51 










Met 


Ala 


Ala 


Leu 


Arg Ala 


Leu 


Cys 


Gly 


Phe 












-15 








-10 












egg 


ggc gtc 


gcg 


gee cag 


gtg 


ctg 


egg 


met 


ggg get 


gga 


gtc 


cga 


ttg 


99 


Arg 


Gly Val 


Ala 


Ala Gin 


Val 


Leu 


Arg 


Xaa 


Gly Ala 


Gly 


Val 


Arg 


Leu 




-5 






1 








5 








10 






ccg 


att cag 


ccc 


age aga 


ggt 


gtt 


egg 


cag 


tgg cag 


cca 


gat 


gtg 


gaa 


147 


Pro 


He Gin 


Pro 


Ser Arg Gly 


Val 


Arg 


Gin 


Trp Gin 


Pro 


Asp 


Val 


Glu 








15 








20 








25 








tgg 


gca cag 


cag 


ttt ggg 


gga 


get 


gtt 


atg 


tac cca 


age 


aaa 


gaa 


aca 


195 



wo 99/53051 

Trp Ala Gin Gin Phe 
30 

gcc cac tgg aag cct 
Ala His Trp Lys Pro 
45 

aca att gtg aag aac 
Thr lie Val Lys Asn 
60 

gcg cat ggt gtc ctg 
Ala His Gly Val Leu 
80 

egg aag tgt gat cct 
Arg Lys Cys Asp Pro 
95 

etc att gaa tac aag 
Leu lie Glu Tyr Lys 
110 



Gly Gly Ala Val 
35 

oca cct tgg aat 
Pro Pro Trp Asn 
50 

att ace etg aac 
lie Thr Leu Asn 
65 

cga eta gtg atg 
Arg Leu Val Met 

cac ate ggg etc 
His lie Gly Leu 
100 

rcc tat ett cag 
Xaa Tyr Leu Gin 
115 



129 

Met Tyr Pro Ser Lys 
40 

gat gtg gac cct cca 
Asp Val Asp Pro Pro 
55 

ttt ggg cec caa cac 
Phe Gly Pro Gin His 
70 

gaa ttg agt ggg gag 
Glu Leu Ser Gly Glu 
85 

ctg cac cga ggc act 
Leu His Arg Gly Thr 
105 

gcc ett cca tac ttt 
Ala Leu Pro Tyr Phe 
120 



PCT/1B99/00712 

Glu Thr 

aag gac 243 
Lys Asp 

cca gea 291 
Pro Ala 
75 

atg gtg 339 

Met Val 

90 

gag aag 387 
Glu Lys 

ga 431 



<210> 224 
<211> 282 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 132 . .281 



<221> sig_peptide 
<222> 132. .215 
<223> Von Heijne matrix 
score 6.5 

seq LVFLHLFLXVYLG/LV 



<400> 224 

attttaaagt gtttctgtta atgtattcta ettcagtcee ecaaaattee aactaacgac 60 
atacatgaat aacagatcat gactgctgtt tctacaagcc tttetgctca ctgtgcttcc 120 
acttacaact c atg tta ata tgg tct tec tct tct ttt cct gca cec cct 170 
Met Leu lie Trp Ser Ser Ser Ser Phe Pro Ala Pro Pro 
-25 -20 
etc ttt ett gtc ttt ett cat ett ttc ett mwt gtc tat ttg ggt ett 218 
Leu Phe Leu Val Phe Leu His Leu Phe Leu Xaa Val Tyr Leu Gly Leu 
-15 -10 -5 1 

gtc atg cec act caa cag tat etc etc ctg cag agt cca ttg atg ttc 266 
Val Met Pro Thr Gin Gin Tyr Leu Leu Leu Gin Ser Pro Leu Met Phe 

5 10 15 

aca gac aaa gcc cag c 282 
Thr Asp Lys Ala Gin 
20 



<210> 225 
<211> 198 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 26. .196 



<221> sig_peptide 
<222> 26. .163 
<223> Von Heijne matrix 
score 6.5 

seq WLFFLMLSLCTPP/DR 



wo 99/53051 



PCT/IB99/00712 



130 



<400> 225 

acttttgctt tatgttcagg ggtcc atg tgt 

Met Cys 

-45 

tgg ata aac gtg tgc cat ggt gat ttg 
Trp lie Asn Val Cys His Gly Asp Leu 

-35 -30 
ctg ggt gtg aag ccg age acg cat tgg 
Leu Gly Val Lys Pro Ser Thr His Trp 

-20 -15 
ctt tgc acc cct cct gac aga ccc tgg 
Leu Cys Thr Pro Pro Asp Arg Pro Trp 
-5 1 

gg 



agg atg tgc agg ttt gtt aca 52 
Arg Met Cys Arg Phe Val Thr 
-40 

ctg cac aga tea tec cgt cgc 100 
Leu His Arg Ser Ser Arg Arg 
-25 , 
eta ttc ttc ctg atg etc tec 148 
Leu Phe Phe Leu Met Leu Ser 
-10 

tgt gtg ttg ttc ccc ccg ctg 196 
Cys Val Leu Phe Pro Pro Leu 
5 10 

198 



<210> 226 
<211> 141 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 21. .140 



<221> sig_peptide 
<222> 21. .113 
<223> Von Heijne matrix 
score 6.5 

seq LSLSLPLSLXLLX/XP 



<400> 226 

gcagttgakr dsacttggta atg tsa acg caa gaa gca ggc ttg aty ttt ttt 53 

Met Xaa Thr Gin Glu Ala Gly Leu lie Phe Phe 
-30 -25 
tct ccc ccc ttc tct etc tet etc tct etc tct cty cct etc tec etc 101 
Ser Pro Pro Phe Ser Leu Ser Leu Ser Leu Ser Leu Pro Leu Ser Leu 
-20 -15 -10 -5 

tyt etc etc tst sac cca cac tea cgc aea eet caa agg g 141 
Xaa Leu Leu Xaa Xaa Pro His Ser Arg Thr Pro Gin Arg 
1 5 



<210> 227 
<211> 206 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 77. .205 



<221> sig_peptide 
<222> 77. .115 
<223> Von Heijne matrix 
score 6 . 5 

seq MEFLLLWSLXSNG/KR 



<400> 227 

awcaartact tgttaagcrt ttacgatgtg ceaggttetg tgctaggtgc tgagtgtaea 60 
ttgttgarce aaaeag atg gag ttc ctg eta ttg tgg agt ttg cmg tct aat 112 
Met Glu Phe Leu Leu Leu Trp Ser Leu Xaa Ser Asn 
-10 -5 
ggg aag aga gge cag gca tgg egg etc atg eet gtw gte eca gea gtt 160 
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131 

Gly Lys Arg Gly Gin Ala Trp Arg Leu Met Pro Val Val Pro Ala Val 

15 10 15 

tgg gag cct gag gca ggt gga ttg ctt cag etc ggg ggt tct agg g 206 
Trp Glu Pro Glu Ala Gly Gly Leu Leu Gin Leu Gly Gly Ser Arg 
20 25 30 

<210> 228 
<211> 480 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 216.. 479 

<221> sig_peptide 
c222> 216. .326 
<223> Von Heijne matrix 
score 6.5 

seq LLVFFLIVRTLSC/RS 

<400> 228 

gcatccccck ktagctcaga gaagtttgtt rdgaccgatc ttctgaagcc tacttctgtc 60 
aactcatcaa agtcattctc catccagctt tgttccatta tgggtgagga gctacgatcc 12 0 
tttggaggag aagaggcact ctgattttta gaattttcag cttttctgct ctggtttcgc 180 
cccatctttg tggttttatc taccttcggt ctttg atg atg gtg acc tac aga 233 

Met Met Val Thr Tyr Arg 
-35 

tgg ggt ttt ggt gtg gat gtc mtt ttt gtt get gtt gat get att cct 281 
Trp Gly Phe Gly Val Asp Val Xaa Phe Val Ala Val Asp Ala lie Pro 

-30 -25 -20 

ttc tgt ttg tta gtt ttc ttt eta ata gtc agg acc etc age tge agg 329 
Phe Cys Leu Leu Val Phe Phe Leu lie Val Arg Thr Leu Ser Cys Arg 
-15 -10 -5 1 

tet gtt gga gta tge tgg agg tec act cca gac cct gtt tgc eta ggt 377 
Ser Val Gly Val Cys Trp Arg Ser Thr Pro Asp Pro Val Cys Leu Gly 

5 10 15 . 

ate acc age aga gge tgc aga aca gaa ata ttg cag aac age aaa tgt 425 
lie Thr Ser Arg Gly Cys Arg Thr Glu He Leu Gin Asn Ser Lys Cys 

20 25 30 

tgc tec ctg ate ctt cct ctg gaa get teg tct caa agg ggc act gaa 473 
Cys Ser Leu He Leu Pro Leu Glu Ala Ser Ser Gin Arg Gly Thr Glu 

35 40 45 

tgt atg a 480 
Cys Met 
50 

<210> 229 
<211> 144 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 43. .144 

<221> sig_peptide 
<222> 43 . .99 
<223> Von Heijne matrix 
score 6 . 5 

seq EIFLPFSLSPANA/QS 



<400> 229 
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tccagatgtg atttggtatt tcatactttg ttgcttttgt aa atg ctg tac cca 54 

Met Leu Tyr Pro 

ctg cct gag ata ttc tta cct ttt tct ttg tec cca gca aat gcc cag 102 
Leu Pro Glu lie Phe Leu Pro Phe Ser Leu Ser Pro Ala Asn Ala Gin 
-15 -10 -5 1 

tea aaa ttt age ett tat ttt ttt ccc ttg gtg aag ccg ggg 144 
Ser Lys Phe Ser Leu Tyr Phe Phe Pro Leu Val Lys Pro Gly 
5 10 15 



<210> 230 
<211> 457 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 314 . .457 



<221> sig_peptide 

<222> 314. .394 

<223> Von Heijne matrix 

score 6.40000009536743 
seq RLLCLXFXRLLLG/TS 

<221> misc_feature 

<222> 118,258. .259,303,440 

<223> n=a, g, c or t 



<400> 230 

agctccgcgg taatggaggc tagggatggg tgctgaagta tcaggctctg gctctagctt 60 
tagctctggc aetggaactg cgtcggagtc tgggtctgag tctggcagcc cgaagccntg 120 
grmcaccttt tcttgattct ctaaggcggg ggctgcctgc gtccaagcag ctggtttgca 180 
gcgttccaac gctgggaggg agttccctta cctggggtcc agtctgtaaa gttgtcgccg 240 
ctttctaggg acccgccnnd scggctggga ctcttccatg cgtgagtatt actgarstgc 300 
tsnaaggtcc ggc atg tec ctg gaa cct gcc teg gsc etc ttg ggt gtg 349 
Met Ser Leu Glu Pro Ala Ser Xaa Leu Leu Gly Val 
-25 -20 
egg egg aga etg ett tgt eta met ttc tse cga ett etc tta ggr aee 397 
Arg Arg Arg Leu Leu Cys Leu Xaa Phe Xaa Arg Leu Leu Leu Gly Thr 
-15 -10 -5 1 

agt ctg ttg aag ttt gtg gkc tec tgs agy cca ccc ama ccg nat act 445 
Ser Leu Leu Lys Phe Val Xaa Ser Xaa Ser Pro Pro Xaa Pro Xaa Thr 

5 10 15 

etc aee tct tee 457 
Leu Thr Ser Ser 
20 



<210> 231 
<211> 112 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 12. .110 



<221> sig_j>eptide 

<222> 12. .83 

<223> Von Heijne matrix 

score 6.40000009536743 
seq LSVLILCVCVCVC/VY 



wo 99/53051 
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<400> 231 

ctgattttkc t atg ytg att ttg tat ctk gca act tta eta aat tta tea 50 
Met Leu lie Leu Tyr Leu Ala Thr Leu Leu Asn Leu Ser 
-20 -15 
gtt eta ata ctt tgt gtg tgt gtg tgt gtg tgt gtg tat gat tta tat 98 
Val Leu lie Leu Cys Val Cys Val Cys Val Cys Val Tyr Asp Leu Tyr 

-10 -5 15 

ata waa agg gga gt 112 
lie Xaa Arg Gly 

<210> 232 
<211> 359 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 8. .358 

<221> sigj>eptide 
<222> 8. .55 

<223> Von Heijne matrix 

score 6.40000009536743 
seq LGTTVLLWSLLRS/SP 

<221> misc_f eature 
<222> 326 

<223> n=a, g, c or t 



<400> 232 



gataatc 


atg 


gcg 


ccc etc 


gga 


aca 


act 


gta 


ttg 


ctg 


tgg 


age 


etc 


ttg 


49 






Met 


Ala 
-15 


Pro Leu 


Gly 


Thr 


Thr 
-10 


Val 


Leu 


Leu 


Trp 


Ser 
-5 


Leu 


Leu 




agg 


agt 


tct 


ccg 


ggc gtg 


gaa 


egg 


gtc 


tgt 


ttc 


egg get 


cga 


ate 


cag 


97 


Arg 


Ser 


Ser 


Pro Gly Val 


Glu 


Arg 


Val 


Cys 


Phe Arg Ala Arg 


He 


Gin 








1 






5 










10 












ccc 


tgg 


cae 


ggt 


ggc ctg 


etc 


caa 


ccg 


eta 


cct 


tgc 


tct 


ttc 


gag 


atg 


145 


Pro 


Trp 


His 


Gly Gly Leu 


Leu 


Gin 


Pro 


Leu 


Pro 


Cys 


Ser 


Phe 


Glu 


Met 




15 








20 










25 










30 




ggg 


ctg 


cea 


cgc 


cgc egg 


ttc 


age 


tee 


gag 


gcc 


gca 


gaa 


tct ggt 


age 


193 


Gly Leu 


Pro 


Arg Arg Arg 


Phe 


Ser 


Ser 


Glu 


Ala 


Ala 


Glu 


Ser Gly Ser 












35 








40 










45 






.cca 


gag 


acc 


aag 


aaa cct 


aca 


ttt 


atg 


gat 


gag 


gaa 


gtt 


caa 


age 


ata 


241 


Pro 


Glu 


Thr 


Lys 
50 


Lys Pro 


Thr 


Phe 


Met 
55 


Asp 


Glu 


Glu 


Val 


Gin 
60 


Ser 


He 




etc 


acg 


aaa 


atg 


aca ggc 


ttg 


aac 


ttg 


cag 


aag 


act 


ttt 


aag 


cea 


get 


289 


Leu 


Thr 


Lys 
65 


Met 


Thr Gly 


Leu 


Asn 
70 


Leu 


Gin 


Lys 


Thr 


Phe 
75 


Lys 


Pro 


Ala 




ata 


caa 


gaa 


ctg 


aag cea 


cea 


acc 


tat 


aag 


eta 


atg 


net 


eag 


gca 


cag 


337 


He 


Gin 
80 


Glu 


Leu 


Lys Pro 


Pro 
85 


Thr 


Tyr 


Lys 


Leu 


Met 
90 


Xaa 


Gin 


Ala 


Gin 




ttg 


gaa 


gag 


get 


aca aga 


eag 


g 


















359 


Leu 


Glu 


Glu 


Ala Thr Arg 


Gin 






















95 








100 

























<210> 233 
<211> 301 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 



wo 99/53051 

<222> 4 . .300 



134 
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<221> sig_peptide 

<222> 4. .105 

<223> Von Heijne matrix 

score 6.40000009536743 
seq LLFWLLPPPPGS/VX 

<221> misc_f eature 
<222> 124,129,162 
<223> n=a, g, c or t 



<400> 233 

gcg atg etc etc act ttc age tec age tec cgc cac cgc cgc etc tat 48 
Met Leu Leu Thr Phe Ser Ser Ser Ser Arg His Arg Arg Leu Tyr 
-30 -25 -20 

cgc cgc cgc cgc cac cac etc etc ttc gtt gtc etc ctt cet cet ccg 96 
Arg Arg Arg Arg His His Leu Leu Phe Val Val Leu Leu Pro Pro Pro 

-15 -10 -5 

cet ggc age gtt gkt etc tge age sgg nrm grn smv raa gtg ctr vbg 144 
Pro Gly Ser Val Xaa Leu Cys Ser Xaa Xaa Xaa Xaa Xaa Val Leu Xaa 

15 10 
kma sga aag ttc egg gan gga eta cat gga gee atg etc cet ggg etc 192 
Xaa Xaa Lys Phe Arg Xaa Gly Leu His Gly Ala Met Leu Pro Gly Leu 

15 20 25 

ttc cgc ggg cgc ccg cgc get gee ctt cgc ttg aga gtc tea ccg wgt 240 
Phe Arg Gly Arg Pro Arg Ala Ala Leu Arg Leu Arg Val Ser Pro Xaa 
30 35 40 45 

tgc cca ggc tgg aaa gtg gcg cga tct egg etc aca gea ace tec gee 288 
Cys Pro Gly Trp Lys Val Ala Arg Ser Arg Leu Thr Ala Thr Ser Ala 

50 55 60 

tcm egg gmc egg g 301 
Ser Arg Xaa Arg 
65 



<210> 234 
<211> 248 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 152. .247 



<221> sig_j>eptide 

<222> 152. .190 

<223> Von Heijne matrix 

score 6.40000009536743 
seq MLLLLQLNLKTLS/SS 



<400> 234 

acaagttggg tgctgtegce tgcgeatcgg egggceggga ggctgageag tactgttgag 60 
agcggtgtga ggtgcttggt agcgegecgt agctgettce aegteettgc ttcacetcag 120 
gtaaagagag aagtaatgga aggcetgtet g atg ttg ctt ctt ttg caa eta 172 

Met Leu Leu Leu Leu Gin Leu 
-10 

aac tta aaa aca etc tea tec agt acc ata gca ttg aag aag ata agt 220 
Asn Leu Lys Thr Leu Ser Ser Ser Thr He Ala Leu Lys Lys lie Ser 

-5 15 10 

9gc gag ttg eta aga aaa cga aag agg g 24 8 

Gly Glu Leu Leu Arg Lys Arg Lys Arg 
15 



wo 99/53051 
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<210> 235 
<211> 393 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 339. .392 



<221> sig_peptide 

<222> 339. .383 

<223> Von Heijne matrix 

score 6.40000009536743 
seq LFVLLIITQLLYG/GI 



<400> 235 

gttccaaagt gagctgtctc tggcagcatt catatagaat agaatttgaa tggtgcaccc 60 
agatttgaac aacatggtaa tcatgtgatg gacatggaaa agtgractaa cbtkrgggat 120 
cwtggtargg tcaytaagaa taactckaat cawgatgtta aaaggctttc ctttacattc 180 
acaaaacaat ttrsttccta gaagtagttt attcttgcct gtggtcattt ttgctccttt 240 
ataatactac atctaaatca atttgttaaa tatagtagag aaatgaaata aatttcttcc 300 
agttaaacca ctgcacttaa agagtagaaa ccctctct atg tea etc ttt gtt ttg 356 

Met Ser Leu Phe Val Leu 
-15 -10 
ttg ate ata act caa ctg ctg tat ggt ggg ata etc t 393 
Leu lie lie Thr Gin Leu Leu Tyr Gly Gly lie Leu 
-5 1 



<210> 236 
<211> 222 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 121. .222 



<221> sigjpeptide 

<222> 121. .204 

<223> Von Heijne matrix 

score 6.40000009536743 
seq ILFLGVLLSASDL/CV 



<400> 236 

ttttgagtta atttttgtat aagttgtaag gattaggtca gggttcttaa gaaaaatatt 60 
gttttggtct atagatgtct cattgcttet gtgctatttg ttggaaaagc tgttcttcca 120 
atg aat tgc ttt tgc aat ttt gtc aaa ace agt gag gca tat atg att 168 
Met Asn Cys Phe Cys Asn Phe Val Lys Thr Ser Glu Ala Tyr Met lie 

-25 -20 -15 

ctg ttt eta ggt gtt eta etc tet gca agt gat tta tgt gtc tat cce 216 
Leu Phe Leu Gly Val Leu Leu Ser Ala Ser Asp Leu Cys Val Tyr Pro 

-10 -5 1 

ate ggg 222 
lie Gly 
5 



<210> 237 
<211> 154 
<212> DNA 
<213> Homo sapiens 



<220> 
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<221> CDS 
<222> 54. .152 



<22l> sig_peptide 

<222> 54. .95 

<223> Von Heijne matrix 

score 6.40000009536743 
seq SVILALWEAEAGG/SP 



<400> 237 

agtcttttgc tcctgtggtt aagattattc tgctaggctg ctcacggtgg ctg atg 56 

Met 

tct gta ate eta gca ctt tgg gag gcc gag gcg ggc gga teg cct gag 104 
Ser Val He Leu Ala Leu Trp Glu Ala Glu Ala Gly Gly Ser Pro Glu 

-10 -5 1 

ate ggg agt teg gga ccg gee gea cea aca tgg aga age cce gtc cag 152 
He Gly Ser Ser Gly Pro Ala Ala Pro Thr Trp Arg Ser Pro Val Gin 

5 10 15 

99 ' 154 

<210> 238 
<211> 439 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 255. .437 



<22l> sigjpeptide 

<222> 255.. 341 

<223> Von Heijne matrix 

seore 6.30000019073486 
seq LGCLLLAVRSSAT/VN 

<221> mise_feature 
<222> 359. .360,381 
<223> n=a, g, c or t 



<400> 238 

teaceaeaat caattttaga aeattttcat catccegaaa ataagccetg ttccctttag 60 
ctgteaetee ceactectac eeceeageee tgtgcaataa tetactttct gtetttgaag 120 
ctttgeetat tctggacatt ttgtataaaa gggtttgtgg aggatgtggt cttttgtgac 180 
tggcttcttg aacttggcat agtgttttea aggttcaace atgttgtagc acgtacgttc 240 
ctttttatgg ccaa atg tae gga gag tec aca ttg ttt ate cat tea tea 290 

Met Tyr Gly Glu Ser Thr Leu Phe He His Ser Ser 
-25 -20 
gtt cat ggg cat ttg ggt tgt etc etc ttg get gtt agg agt agt get 338 
Val His Gly His Leu Gly Cys Leu Leu Leu Ala Val Arg Ser Ser Ala 

-15 -10 -5 

act gtg aae att acg tac chn nkw gtk tgt gtg gac att oak ntt cat 386 
Thr Val Asn He Thr Tyr Xaa Xaa Val Cys Val Asp He Xaa Xaa His 

15 10 15 

tte cat atg ctt atg tct gga att act ggg tea tat ggc aae tct ctt 434 
Phe His Met Leu Met Ser Gly He Thr Gly Ser Tyr Gly Asn Ser Leu 

20 25 30 

tea ct 
Ser 



439 



<210> 239 
<211> 229 
<212> DNA 
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<220> 
<221> CDS 
<222> 7. .228 

<221> sig_peptide 

<222> 7. .159 

<223> Von Heijne matrix 

score 6.30000019073486 

seq WLYLLEWAPLSG/IH 

<400> 239 

gtcaag atg gcg gcg tct gta tta aac acc gtg ctg agg egg ctt cct 48 
Met Ala Ala Ser Val Leu Asn Thr Val Leu Arg Arg Leu Pro 
-50 -45 -40 

atg eta tct etc ttc cga ggt tct cac aga gtt cag gta act ctt cga 96 
Met Leu Ser Leu Phe Arg Gly Ser His Arg Val Gin Val Thr Leu Arg 

-35 -30 -25 

aag aca ttt tgc aca acc tea agt tgg tta tac ctt etc gag gtt gtc 144 
Lys Thr Phe Cys Thr Thr Ser Ser Trp Leu Tyr Leu Leu Glu Val Val 

-20 -IS -10 

get cca ctg tea gga ate eac gag tgg aga cct tec eae gtg tgt ctt .192 
Ala Pro Leu Ser Gly lie His Glu Trp Arg Pro Ser His Val Cys Leu 
-5 15 10 

age tgt eta ggc agt act tec tgc aac ccc cct gag g 229 
Ser Cys Leu Gly Ser Thr Ser Cys Asn Pro Pro Glu 
15 20 

<210> 240 
<211> 318 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 65. .316 

<221> sig_peptide 

<222> 65. .259 

<223> Von Heijne matrix 

score 6.30000019073486 

seq LMWAETSQGSWS/AP 

<221> mise_feature 
<222> 259 

<223> n=a, g, c or t 



<400> 240 

ctetteggtt gtecagcect tetcceagcc etggtecctc agaaggaggg taactecett 60 
ccag atg tta egg tec gee tgc gtc tct cag eac gee ggt ggc att tgg 109 

Met Leu Arg Ser Ala Cys Val Ser Gin His Ala Gly Gly lie Trp 

-65 -60 -55 

gtt gae cgc gga ggc ccc cag tgc cag agg gtg ttc aeg ttc tgc egt 157 
Val Asp Arg Gly Gly Pro Gin Cys Gin Arg Val Phe Thr Phe Cys Arg 
-50 -45 -40 -35 

ggg etc age cca aac ttt gga cgc tea gag acc caa egg gag cgc tgg 205 
Gly Leu Ser Pro Asn Phe Gly Arg Ser Glu Thr Gin Arg Glu Arg Trp 

-30 -25 -20 

ata agg cca gga cag ctg atg gtt gtg gca gaa aca tct caa ggt age 253 
lie Arg Pro Gly Gin Leu Met Val Val Ala Glu Thr Ser Gin Gly Ser 
-15 -10 -5 
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tgg ten gcc ccc act tec cca tst acc tct tgt cct ccc ccc aae acc 301 
Trp Ser Ala Pro Thr Ser Pro Xaa Thr Ser Cys Pro Pro Pro Asn Thr 

15 10 
asc acc aca ccg gyt cc 318 
Xaa Thr Thr Pro Xaa 
15 

<210> 241 
<211> 405 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 123. .404 

<221> sig^peptide 

<222> 123. .257 

<223> Von Heijne matrix 

score 6.30000019073486 

seq GFVSLLWHAADA/WV 

<400> 241 

tagctggacc cgtctgggag gtaggtttgt gagcgtgaga gaksgatctg taccgcgggg 60 
atccgaagta tgcttatcca ggtgggctgc ctcaagcctc gatcccaccc ccgcgctdvt 120 
^9 atg gtg tea agg tec ttg cgt ggg aga agg act tgg gtg aga tgc 167 
Met Val Ser Arg Ser Leu Arg Gly Arg Arg Thr Trp Val Arg Cys 





-45 










-40 










-35 












atg 


egg 


aga 


ttg 


ccc 


cca 


att 


ccg 


gcc 


tgg 


age 


caa 


ggg 


aaa 


ggg 


atg 


215 


Met 


Arg 


Arg 


Leu 


Pro 


Pro 


He 


Pro 


Ala 


Trp 


Ser 


Gin 


Gly 


Lys 


Gly 


Met 




-30 










-25 










-20 










-15 




cct 


gga 


ttt 


gtg 


tct 


eta 


ttg 


gtg 


gtc 


eac 


get 


gcg 


gat 


gee 


tgg 


gta 


263 


Pro Gly 


Phe 


Val 


Ser 


Leu 


Leu 


Val 


Val 


His 


Ala 


Ala 


Asp 


Ala 


Trp 


Val 












-10 










-5 










1 






gcc 


eag 


agg 


ttr 


tct 


acg 


cca 


tac 


ttc 


tea 


etg 


ttt 


ttg 


age 


ata 


cct 


311 


Ala 


Gin 


Arg 

5 


Leu 


Ser 


Thr 


Pro 


Tyr 
10 


Phe 


Ser 


Leu 


Phe 


Leu 
15 


Ser 


He 


Pro 




aga 


tgt 


tec 


ttt 


cct 


agg 


egg 


agt 


ata 


gat 


cgc 


acg 


tgt 


tct 


age 


ste 


359 


Arg 


Cys 


Ser 


Phe 


Pro Arg Arg 


Ser 


He Asp Arg 


Thr 


Cys 


Ser 


Ser 


Xaa 






20 










25 










30 












tta 


gae 


tea 


gag 


ggt 


teg 


age 


tct 


ata 


asc 


ccc 


tee 


act 


ccc 


ttc 


a 


405 


Leu Asp 


Ser 


Glu 


Gly Ser Ser 


Ser 


He 


Xaa 


Pro 


Ser 


Thr 


Pro 


Phe 







35 40 45 

<210> 242 
<211> 242 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 129. .242 

<221> sig_peptide 

<222> 129. .191 

<223> Von Heijne matrix 

score 6.30000019073486 

seq SLLPCSLISDCCA/SN 

<400> 242 

cttttgtttt gcaatgccet gcecccagag gtggagteta eagaggcagg caggectcet 60 

tgagctgagg tgggetccac ceagttcgag ctteeeaget get t tgt tta ectactcaag 120 

cctgggca atg gtg gge gee ctt ccc cca gcc teg ett etg cct tgc agt 170 
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Met Val Gly Ala Leu Pro Pro Ala Ser Leu Leu Pro Cys Ser 
-20 -15 -10 

ttg ate tea gac tgc tgt get age aat gag cga ggc tec atg gge gta 218 
Leu lie Ser Asp Cys Cys Ala Ser Asn Glu Arg Gly Ser Met Gly Val 

-5 15 
gga ccc tct gag cca egg cgy ggg 242 
Gly Pro Ser Glu Pro Arg Arg Gly 
10 15 

<210> 243 
<211> 363 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 298. .363 

<221> sig^peptide 

<222> 298. .357 

<223> Von Heijne matrix 

score 6.30000019073486 

seq LGSLIASLAPSTG/LG 

<400> 243 

accactctga ggagacgcgt gacagataag aagggctggt gggatcagtc ctggtggtag 60 
ctcaggaagc agagcctgga gcatctccac tatggcctgg gctccactac ttctcaccct 120 
cctcgctcac tgcaeaggtt cttgggccaa ctttatgctg actcagecgc actctgtgtc 180 
ggagtcgccg gssgaagacg gtaaccatet cctgcaeccg cagcagtggc agctttgtca 240 
geaactatgt teagtggtac cagcggcgcc cggacagtgc ccccaccact gtgatct 297 
atg agg atg aca aaa gac ect ctg ggg tct ctg ate get tct ctg get 345 
Met Arg Met Thr Lys Asp Pro Leu Gly Ser Leu lie Ala Ser Leu Ala 
-20 -15 -10 -5 

cca teg aca ggt ctt ggg 363 
Pro Ser Thr Gly Leu Gly 
1 

<210> 244 
<211> 324 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> CDS 
<222> 153. .323 

<221> sig_peptide 

<222> 153.. 236 

<223> Von Heijne matrix 

score 6.30000019073486 

seq FFLFLFFXEXXXX/XX 

<400> 244 

aattgatact gctttagatg tttctgtctc attttacaaa aatgtaagaa aaaagaaaaa 60 
tcaaactata ctgttaccta tttcttgtat attcttaaca gaatgttctg tacacataag 120 
tgtatgtgtg ttaatcctct tgtttaaatg cc atg aaa ctt cag ttt gcc ttt 173 

Met Lys Leu Gin Phe Ala Phe 

-25 

tgt tat ttt ctt tat tta gat ace ttt ttt ctt ttt ctt ttt ttt ttk 221 
Cys Tyr Phe Leu Tyr Leu Asp Thr Phe Phe Leu Phe Leu Phe Phe Xaa 

-20 -15 -10 

gag ama gyc tkg eye kgt kgc hta ggm agg agt gca gtg gca maa cct 269 
Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Arg Ser Ala Val Ala Xaa Pro 



wo 99/53051 



PCT/IB99/00712 



140 

-5 1 5 10 

cag etc ayt gca gcc tec acc ttc kgg tty caa gca att tty ctg ccc 317 

Gin Leu Xaa Ala Ala Ser Thr Phe Xaa Phe Gin Ala He Phe Leu Pro 

15 20 25 

cag ckg g 324 
Gin Xaa 

<210> 245 
<211> 280 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 27. .278 

<221> sig_peptide 

<222> 27. .233 

<223> Von Heijne matrix 

score 6.30000019073486 

seq GILKVLLFSWSG/LE 

<400> 245 

gttgcggggc ggggccttcg cagagc atg gcg gcg ggc gag ctt gag ggt ggc 53 

Met Ala Ala Gly Glu Leu Glu Gly Gly 

-65 



aaa 


ccc 


ctg age ggg 


ctg 


ctg 


aat 


gcg 


ctg 


gcc 


cag 


gac 


act 


ttc 


cac 


101 


Lys 


Pro 


Leu Ser Gly 


Leu 


Leu 


Asn 


Ala 


Leu 


Ala 


Gin 


Asp 


Thr 


Phe 


His 




-60 






-55 










-50 










-45 




ggg 


tac 


ccc ggc ate 


aca 


gag 


gag 


ctg 


eta 


egg 


age 


cag 


eta 


tat 


cca 


149 


Gly Tyr 


Pro Gly He 


Thr 


Glu 


Glu 


Leu 


Leu Arg 


Ser 


Gin 


Leu 


Tyr 


Pro 








-40 










-35 










-30 






gag 


gtg 


cca ccc gag 


gag 


ttc 


cac 


ccc 


ttt 


ctg 


gca 


aag 


atg 


agg 


ggg 


197 


Glu 


Val 


Pro Pro Glu 
-25 


Glu 


Phe 


His 


Pro 
-20 


Phe 


Leu 


Ala 


Lys 


Met 
-15 


Arg 


Gly 




att 


ctt 


aag gta ctg 


etc 


ttt 


tct 


gta 


gtc 


tec 


ggc 


ttg 


gag 


cag 


aac 


245 


He 


Leu 


Lys Val Leu 
-10 


Leu 


Phe 


Ser 
-5 


Val 


Val 


Ser 


Gly 


Leu 
1 


Glu 


Gin 


Asn 




ccc 


ttg 


gcc get ggc 


ttc 


aga 


etc 


tec 


cac 


ccg 


gg 










280 


Pro 


Leu 


Ala Ala Gly 


Phe Arg 


Leu 


Ser 


His 


Pro 















5 10 15 

<210> 246 
<211> 211 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 70. .210 

<221> sig_jpeptide 

<222> 70. . 162 

<223> Von Heijne matrix 

score 6.30000019073486 

seq SLILSPSPRPVLG/FF 

<400> 246 

tttggctggg gagacccatc tggactacca aggagaagct atagactact tetactccac 60 
caggaaggt atg atg atg tea aac gtg atg ctg atg eta cag tta cag ccc 111 
Met Met Met Ser Asn Val Met Leu Met Leu Gin Leu Gin Pro 
-30 -25 -20 

ctg ctg gcg cas tct ctg att etc tct ccc tct ccg cgt cca gtg ctg 159 



wo 99/53051 

Leu Leu Ala Xaa Ser Leu He Leu Ser 

-15 -10 
ggc ttt ttc aga caa gtg cat etc eta 
Gly Phe Phe Arg Gin Val His Leu Leu 
1 ■ 5 

tgg g 
Trp 
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Pro Ser Pro Arg Pro Val Leu 
-5 

acc agg tea cat ttc age cgc 207 
Thr Arg Ser His Phe Ser Arg 
10 15 

211 



<210> 247 
<211> 359 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 249. .359 



<221> sig_peptide 

<222> 249. .308 

<223> Von Heijne matrix 

score 6.19999980926514 

seq LLFICPPPPPISA/SS 

<400> 247 

tttcagaatt ttgtgeagga atatctgagt atttctaatt agattagaat gtcagaatac 60 
atteatggac atatgagggg ttttttaaaa ttttttttag atatccttca ccttgaacat 120 
ttattatttc tttgtgttgg gaacaatcca aatctctcct agatgttttg aaatgtgcaa 180 
tgtattgtta gctgtagtca ccctactgtg etattgaata etagagcttg ttccttetgt 240 
ctaactgt atg att ata etc att aac caa ctt etc ttc ate tgt cec eca 290 

Met He He Leu He Asn Gin Leu Leu Phe He Cys Pro Pro 

-20 -15 -10 

cct cca ccc ate tea gee tct agt aac tac cat ttt act etc tac etc 338 
Pro Pro Pro He Ser Ala Ser Ser Asn Tyr His Phe Thr Leu Tyr Leu 

-5 15 10 

cat gac att aac ttt ttt age 359 
His Asp He Asn Phe Phe Ser 
15 



<210> 248 
<211> 236 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 182. .235 



<221> sigjeptide 

<222> 182. .226 

<223> Von Heijne matrix 

score 6.19999980926514 
seq DVLLQLLLRVCSP/RT 



<400> 248 

attgggttaa ttteactgca ctgactattt ttagatatat attctttgtg cettcactag 60 
aactcctctt acttcatgat atcttaacta taaaatcatc caaccatgaa aacaagcaca 120 
caagaaacag aaacaaaaca gtcacaaaaa ageataaact gttagcattg atecatgatg 180 
a atg act gat gta tta ctt caa ttg eta tta aga gtg tgt tct ccc agg 229 
Met Thr Asp Val Leu Leu Gin Leu Leu Leu Arg Val Cys Ser Pro Arg 
-15 -10 -5 1 

ace agg g 236 
Thr Arg 
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<210> 249 
<211> 342 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 266. .340 



<221> sig_j>eptide 

<222> 266. .304 

<223> Von Heijne matrix 

score 6.19999980926514 
seq MGIiFLCCSLLIFC/LV 



<400> 249 

taggctattt cttaattttc cttctaggat tcttatagtt tgaagtttta catttagatc 60 
gtcaatccat cttgagttca tttttgtata tgatgaaaag taggggtctg attttattct 120 
tctgcataag accagttatc ccagaaccgt ttgttgaata ggaagttctt ttctcattgc 180 
ttgtttgtgg ggactttgtc aaagatcaaa tagttatagg tgtgtggctg tatttcaggg 240 
tttctttatt ccatttcact gatct atg ggt ctg ttt ttg tgc tgc tct tta 292 

Met Gly Leu Phe Leu Cys Cys Ser Leu 
-10 -5 
ctg ata ttc tgt ctg gtt gtt eta ate ata act gaa ctg ggc tat ggg 340 
Leu lie Phe Cys Leu Val Val Leu lie lie Thr Glu Leu Gly Tyr Gly 

15 10 
gg 342 



<210> 250 
<211> 382 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 291. .380 



<221> sigjpeptide 

<222> 291. .332 

<223> Von Heijne matrix 

score 6.19999980926514 
seq GSWALTWLHPAEA/GT 

<221> misc_feature 
<222> 264 . .265,279. .280 
<223> n=a, g, c or t 



<400> 250 

ataeagcggc ctctgacacc ageacagcaa acccgccggg atcaaagtgt aceagteggc 60 
agcatgggta aggagagggg tttccaatca cccattgect gctctgtctg eccctaattt 120 
ggaaaggcee tcctccagaa aatgctagaa aacctgagtg gggagetggg gagggagtag 180 
tggactctgc ttcattgtcc ceagtctgca caccccctcc cceaecaccc cactgcattt 24 0 
cecagctcag ccaaaetttc tgannaagac gggcagagnn ctgctgggag atg gga 296 

Met Gly 

tec tgg gcc ctg act tgg etc cat cea gca gag get ggg ace agg gtg 344 
Ser Trp Ala Leu Thr Trp Leu His Pro Ala Glu Ala Gly Thr Arg Val 

-10 -5 1 

cct ttc tgc age tgg gaa aaa tea gat ggg cgc tct ta 382 
Pro Phe Cys Ser Trp Glu Lys Ser Asp Gly Arg Ser 
5 10 15 



<210> 251 
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<211> 303 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 108. .302 

<221> sig_peptide 

<222> 108. .233 

<223> Von Heijne matrix 

score 6.19999980926514 

seq LSVLSLVINFSWS/RK 

<221> misc__f eature 
<222> 279 

<223> n=a, g, c or t 



<400> 251 

aaaagctgtg aggttgtaac tcagttcagt agtatttata aatatttgtt ttccactttt 60 
gtgcatatta tacaaatgat ggatataaaa tttgtttwga ccatwta atg atg ctt 116 

Met Met Leu 
-40 

rmw wwr rra aga gga tat cct cat aga act gaa cgt tat gat gga ttt 164 
Xaa Xaa Xaa Arg Gly Tyr Pro His Arg Thr Glu Arg Tyr Asp Gly Phe 

-35 -30 -25 

tta aaa tat tct gac oca aat gat att gca ttg tea gta ctg tec ctg 212 
Leu Lys Tyr Ser Asp Pro Asn Asp He Ala Leu Ser Val Leu Ser Leu 

-20 -15 
gtt att aat ttc tec tgg agt aga aaa tgc ttt gtt cct tac tat ate 260 
Val He Asn Phe Ser Trp Ser Arg Lys Cys Phe Val Pro Tyr Tyr He 

-5 15 
cca ttt aaa cct tac cgv nta cct tac ccc ace geg gee egg g 303 
Pro Phe Lys Pro Tyr Arg Xaa Pro Tyr Pro Thr Ala Ala Arg 
10 15 20 

<210> 252 
<211> 259 
<212> DNA 
<2I3> Homo sapiens 



<220> 
<221> CDS 
<222> 106. .258 



<221> sig_peptide 

<222> 106. .222 

<223> Von Heijne matrix 

score 6.19999980926514 
seq CFVCXLFVFLLSG/LN 

<221> misc_feature 
<222> 134 

<223> n=a, g, c or t 



<400> 252 

atttaaaagg attttttaaa ggacctctat agttataagt cagcttaatt aaaaatggat 
attccatagt catatttata tatatataca cacacatata tatgt atg tat gtg tgt 

Met Tyr Val Cys 

ata tat ata trt tta ana gac ctg tat gat ttt ttt ctt ctt gga act 
He Tyr He Xaa Leu Xaa Asp Leu Tyr Asp Phe Phe Leu Leu Gly Thr 
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-35 -30 -25 -20 

tat ttt ttt gag aga aag tgt ttt gtg tgt ktg ttg ttt gtt ttt ctt 213 

Tyr Phe Phe Glu Arg Lys Cys Phe Val Cys Xaa Leu Phe Val Phe Leu 

-15 -10 -5 

etc agt gga ctg aat tat ttc tec att ctg tct ttt tac cce egg g 259 

Leu Ser Gly Leu Asn Tyr Phe Ser lie Leu Ser Phe Tyr Pro Arg 
15 10 



<210> 253 
<211> 165 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 14. .163 



<221> sig_peptide 

<222> 14- .133 

<223> Von Heijne matrix 

score 6.19999980926514 
seq FITFIFSFSFCEC/IV 



<400> 253 

atattttata cac atg tgc ata ttc tgc ctt ttt cat tta eta tat cat 49 
Met Cys He Phe Cys Leu Phe His Leu Leu Tyr His 
-40 -35 -30 

aaa ctt ctt tct aga tea tta ttt ttc tgc tgc att ttt tea gga ttt 97 

Lys Leu Leu Ser Arg Ser Leu Phe Phe Cys Cys He Phe Ser Gly Phe 
-25 -20 -15 

ate ace ttt att ttt agt ttt agt ttt tgt gag tgt ata gta ggt atg 145 

He Thr Phe He Phe Ser Phe Ser Phe Cys Glu Cys He Val Gly Met 
-10 -5 1 

tat att tat ggg gca cga ag 165 

Tyr He Tyr Gly Ala Arg 

5 10 



<210> 254 
<211> 328 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 207. .326 



<221> sigjpeptide 

<222> 207. .287 

<223> Von Heijne matrix 

score 6.19999980926514 
seq LLIFLYLSLNLFC/IF 



<400> 254 

aaacgttttc ttttatctaa tttatattaa atttaattaa aataagccag gaggctagtg 60 
gctaccatgt tageagcaca gteetatata ttctttcact tttgttacat ttgttatcaa 120 
ttttaactac tattattatt acatacaata caattttaac aataggagat tgctattaga 180 
tgaggettta acagaaaarv attaav atg ara ata tgc tat aac att ttt caa 233 

Met Xaa He Cys Tyr Asn He Phe Gin 
-25 -20 
aac att etc ggc etc ttg ctt att ttc ctg tat ctt tct ttg aat ctt 281 
Asn He Leu Gly Leu Leu Leu He Phe Leu Tyr Leu Ser Leu Asn Leu 

-15 -10 -5 

ttt tgt att ttc ttt tct gtc cet gee ctt caa ect aga aga ctg gg 328 



wo 99/53051 



PCT/IB99/00712 



145 

Phe Cys lie Phe Phe Ser Val Pro Ala Leu Gin Pro Arg Arg Leu 
15 10 



<210> 255 

<211> 320 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> 233. .319 



<221> sigjpeptide 

<222> 233 . .310 

<223> Von Heijne matrix 

score 6.09999990463257 
seq MLTLLGFPSKALT/FI 

<221> mis cofeature 
<222> 129 

<223> n=a, g, c or t 



<400> 255 

caagttgtct cctgcgtagt gtctattagc tcttgaattt cttcaagatc catatactga 60 
aacacttcac tctccaactt ttttgccata ttgacaatca ctttcataat ttcacttatt 120 
gacyctgynw haaatcmtgt gaagyhatgc agahcatctg gacacagctt tctccagcag 180 
ggatyyatdg ttttgggctt gaaggggttt cacggctttt tctataacaa eg atg gca 238 

Met Ala 
-25 

tct tea atg ctg waa tec ttc cag act ttc atg atg ttg act eta ttg 286 
Ser Ser Met Leu Xaa Ser Phe Gin Thr Phe Met Met Leu Thr Leu Leu 

-20 -15 -10 

ggt ttc ect tee aaa get ttg aca ttc att tee a 320 
Gly Phe Pro Ser Lys Ala Leu Thr Phe lie Ser 
-5 1 



<210> 256 
<211> 305 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 205. .303 



<221> sigjpeptide 

<222> 205. .264 

<223> Von Heijne matrix 

score 6.09999990463257 
seq LLSLPGSFIPGNC/RP 



<400> 256 

tttgttttat ttggttattt gttttgtttt gtttctctga ggccaatggg tgggaggaag 60 
tataaagaag tgtaaacagg aaagccagct gggcetggag ttccaagtgc ecatatttca 120 
teagcttcet ctceataact gtggcaggga cacttaacce ttccctggct gtgagaagtt 180 
attctctgag ggctggtgag caga atg gga aga tct aag agg cag etc ctt 231 

Met Gly Arg Ser Lys Arg Gin Leu Leu 
-20 -15 
tec ttg ect ggt tec ttt ate cet ggg aat tgc agg cea agg att ctg 279 
Ser Leu Pro Gly Ser Phe He Pro Gly Asn Cys Arg Pro Arg He Leu 

-10 -5 15 

age aat ggw gaa gwc aga agg aag gg 3 05 
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Ser Asn Gly Glu Xaa Arg Arg Lys 
10 

<210> 257 
<211> 181 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 37. .180 

<221> sig_peptide 

<222> 37 . .111 

<223> Von Heijne matrix 

score 6.09999990463257 

seq CFFLIFLLPPLPA/MI 

<400> 257 

tttctaatgc tattatcctg mtagtgamta agtctc atg aga tct gat ggg ttt 54 

Met Arg Ser Asp Gly Phe 
-25 -20 



ate agg ggt ttc 


tgc 


ttc 


tgc 


ttc 


ttc 


eta 


att 


ttt etc 


ctg 


cca 


eeg 


102 


He Arg Gly Phe 


Cys 


Phe 


Cys 


Phe 


Phe 


Leu 


He 


Phe Leu 


Leu 


Pro 


Pro 






-15 










-10 








-5 






ctt cct gcc atg 


ata 


ctg 


agg 


cct 


ctg 


cag 


cca 


tgt gga 


att 


ata 


agt 


150 


Leu Pro Ala Met 


He 


Leu 


Arg 


Pro 


Leu 


Gin 


Pro 


Cys Gly 


He 


He 


Ser 




1 








5 








10 










cca att aaa cct 


ctt 


ttt 


cct 


ttt 


ttt 


ttt 


t 










181 


Pro He Lys Pro 


Leu 


Phe 


Pro 


Phe 


Phe 


Phe 















15 20 



<210> 258 
<211> 236 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 119. .235 

<221> sig_peptide 

<222> 119. .166 

<223> Von Heijne matrix 

score 6.09999990463257 

seq LWTASSLPLSTHS/QR 

<400> 258 

caaaaaaatc agtctttaag catttgcttg gtaaggtttc ttaagattag gtttataata 60 
caaccatctg taatgtatct stcgtttgag cttgtgggcc atacaattca ttaactag 118 
atg aat aca ttg tgg aea gca tec tea eta ccc etc tct act cac tea 166 
Met Asn Thr Leu Trp Thr Ala Ser Ser Leu Pro Leu Ser Thr His Ser 

-15 -10 -5 

caa aga acc atg ata cac tgg aat gtt ttt etc tgg aat tct ttc tac 214 
Gin Arg Thr Met He His Trp Asn Val Phe Leu Trp Asn Ser Phe Tyr 
15 10 15 

tct tgt att aaa att ttt ccc c 236 
Ser Cys He Lys He Phe Pro 
20 



<210> 259 
<211> 265 
<212> DNA 
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<213> Homo sapiens 

<220> 
<221> CDS 
<222> 128. .265 



<221> sigjpeptide 

<222> 128. .220 

<223> Von Heijne matrix 

score 6.09999990463257 
seq CLIGLLVPLLGWG/NQ 



60 



<400> 259 

gacttaggat ttgagcatct ttctgttatg ctgttgcccc actcctattg caatactccc 
cttcttaaga aagtttttct agactaatgt ctagattaaa cttcttttct ttgacaataa 120 
tgatgcc atg act tgg aca aaa tgc cca ttg cct ctg ggt cct get ttc 169 

Met Thr Trp Thr Lys Cys Pro Leu Pro Leu Gly Pro Ala Phe 
'30 -25 -20 

ttc acc cag tgc tgc ctt att gga etc ctt gtg cct etc ctt ggc tgg 217 
Phe Thr Gin Cys Cys Leu lie Gly Leu Leu Val Pro Leu Leu Gly Trp 

-15 -10 -5 

gga aat cag aat aca cag tgg tat cec act tct aag atg cct gat ggg 265 
Gly Asn Gin Asn Thr Gin Trp Tyr Pro Thr Ser Lys Met Pro Asp Gly 
1 5 10 15 

<210> 260 
<211> 272 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 162. .272 



<221> sigjpeptide 

<222> 162. .257 

<223> Von Heijne matrix 

score 6.09999990463257 
seq IVYFLVLLRVLYT/LQ 



<400> 260 

cacaaggttg atttaaaatt cttaaaaaat ttttcaaaat ctttccaaat gaaacaagat 60 
ttattgttaa tctacagaaa tatcctccat tcactttgat atttaaatga catcgtacat 120 
tttaggtaga gcatttttat gaccactcat tgcttagtct g atg ggg agg age aat 176 

Met Gly Arg Ser Asn 
-30 

gat ttt agg ttt gee ttt eta aca tgc ttt ctt gga tgg gaa ata gta 224 
Asp Phe Arg Phe Ala Phe Leu Thr Cys Phe Leu Gly Trp Glu lie Val 

-25 -20 -15 

tat ttc ttg gtg ctt ctt cgt gtt tta tac act tta caa tgg ggt ggg 272 
Tyr Phe Leu Val Leu Leu Arg Val Leu Tyr Thr Leu Gin Trp Gly Gly 
-10 -5 15 

<210> 261 

<211> 98 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 26. .31 



<221> sigjpeptide 
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<222> 26 , .79 

<223> Von Heijne matrix 

score 6.09999990463257 

seq LTSFLTYMPLISS/SC 

<400> 261 

tttctaggta tacaatcata ttata atg aaa aca gat aat ttg act tct ttt 

Met Lys Thr Asp Asn Leu Thr Ser Phe 
-15 -10 
ctt aca tat atg cct ctt att tct tec tct tgc tea att get ccc t 
Leu Thr Tyr Met Pro Leu He Ser Ser Ser Cys Ser He Ala Pro 
-5 1 5 

<210> 262 
<211> 419 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 28. .417 

<221> sig_j5eptide 

<222> 28. .264 

<223> Von Heijne matrix 

score 6.09999990463257 

seq ATVAVLSFILSSA/AK 

<400> 262 

attcccgggc cctggcttct tggcgcg atg agg ttc egg ttc tgt ggt gat ctg 

Met Arg Phe Arg Phe Cys Gly Asp Leu 
-75 



gae 


tgt 


ccc 


gae 


tgg 


gtc 


ctg 


gca 


gaa 


ate 


age 


acg 


ctg 


gcc 


aag 


atg 


102 


Asp Cys 


Pro Asp 


Trp 


Val 


Leu 


Ala 


Glu 


He 


Ser 


Thr 


Leu 


Ala 


Lys 


Met 




-70 










-65 










-60 








-55 




tec 


tct 


gtg 


aag 


ttg 


egg 


ctg 


etc 


tgc 


age 


cag 


gta 


eta 


aag 


gag 


ctg 


150 


Ser 


Ser 


Val 


Lys 


Leu 


Arg 


Leu 


Leu 


Cys 


Ser 


Gin 


Val 


Leu 


Lys 


Glu 


Leu 












-50 










-45 








-40 






ctg 


gga 


cag 


ggg 


att 


gat 


tat 


gag 


aag 


ate 


ctg 


aag 


etc 


acg 


get 


gae 


198 


Leu 


Gly 


Gin Gly 


He 


Asp 


Tyr Glu Lys 


He 


Leu 


Lys 


Leu 


Thr 


Ala 


Asp 










-35 










-30 










-25 






gcc 


aag 


ttt 


gag 


tea 


ggc 


gat 


gtg 


aag 


gcc 


aca 


gtg 


gca 


gtg 


ctg 


agt 


246 


Ala 


Lys 


Phe 


Glu 


Ser Gly 


Asp 


Val 


Lys 


Ala 


Thr 


Val 


Ala 


Val 


Leu 


Ser 








-20 










-15 










-10 










ttc 


ate 


etc 


tee 


agt 


gcg 


gee 


aag 


cac 


agt 


gtc 


gat 


ggc 


gaa 


tec 


ttg 


294 


Phe 


He 
-5 


Leu 


Ser 


Ser 


Ala 


Ala 
1 


Lys 


His 


Ser 


Val 


Asp Gly Glu 


Ser 


Leu 




tec 


agt 


gaa 


ctg 


cag 


cag 


ctg 


ggg 


ctg 


ccc 


5 

aaa 


gag 


cac 


gcg 


gcc 


10 
age 


342 


Ser 


Ser 


Glu 


Leu 


Gin 


Gin 


Leu 


Gly Leu 


Pro 


Lys 


Glu 


His 


Ala 


Ala 


Ser 












15 










20 










25 






ctg 


tgc 


cgc 


tgt 


tat 


gag 


gag 


aag 


caa 


age 


ccc 


ttg 


cag 


aag 


cac 


ttg 


390 


Leu 


Cys 


Arg 


Cys 


Tyr Glu 


Glu 


Lys 


Gin 


Ser 


Pro 


Leu 


Gin 


Lys 


His 


Leu 










30 










35 










40 








egg 


gtc 


tgc 


age 


eta 


cgc 


atg 


aat 


agg 


tt 














419 


Arg Val 


Cys 


Ser 


Leu 


Arg 


Met 


Asn 


Arg 

















45 50 



<210> 263 
<211> 371 
<212> DNA 

<213> Homo sapiens 



<220> 
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<221> CDS 
<222> 133. 



.369 



<221> sig_peptide 

<222> 133 . . 174 

<223> Von Heijne matrix 

score 6.09999990463257 
seq FLAALFTVAKIWK/QP 



<400> 263 

cactatggag aactgtatgg cgggtcctca aaaaactaaa aatagaactc ccatatgatc 60 
cagcaatccc attgctaggt atataccccc ccaaaaaagg aaatcagtat atgaaagaga 120 
tatctgaatc cc atg ttt ctt gca gca ctg ttt aca gta get aag att tgg 171 
Met Phe Leu Ala Ala Leu Phe Thr Val Ala Lys He Trp 
-10 ~5 
aag caa cct aag tgt tea tea aca aac aaa tgg aca aag aaa atg tgg 219 
Lys Gin Pro Lys Cys Ser Ser Thr Asn Lys Trp Thr Lys Lys Met Trp 

15 10 15 

tac ata tac aca atg gag tac tat tea gcc ata aaa aaa gat gat ate 267 
Tyr He Tyr Thr Met Glu Tyr Tyr Ser Ala He Lys Lys Asp Asp He 

20 25 30 

Ctg tea ttt gca aca ata tgg atg gaa ctg gag age att aca tta agt 315 
Leu Ser Phe Ala Thr He Trp Met Glu Leu Glu Ser He Thr Leu Ser 

35 40 45 

gaa ata agt ggg sea cca aaa gac aaa ctt etc atg ttc tea etc att 363 
Glu He Ser Gly Xaa Pro Lys Asp Lys Leu Leu Met Phe Ser Leu He 

50 55 60 

tgt gga ag 371 
Cys Gly 
65 



<210> 264 
<211> 283 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 194 . -283 



<221> sigjeptide 

<222> 194 . .274 

<223> Von Heijne matrix 

score 6.09999990463257 
seq LSILQSLVPAAGA/XS 



<400> 264 

ctcattcect gtcctcggat cacagtctct tctcactaea gtgtcgccgc ctctgcctgc 60 
gtaseceggc eatggetetg tageetcgac ccetttgtgc ccecggcccg tetecgcget 120 
caeeacgcct gcgctetceg etcccacett etttcttcag eegaggecgc cgcegectct 180 
ccttgctgca gcc atg gag tct tec act ttc gee ttg gtg cct gtc ttc 229 
Met Glu Ser Ser Thr Phe Ala Leu Val Pro Val Phe 
-25 -20 
gcc cac ctg age ate etc cag age etc gtg cca get get ggt gca gyc 277 
Ala His Leu Ser He Leu Gin Ser Leu Val Pro Ala Ala Gly Ala Xaa 
-15 -10 -5 1 

tct cct 283 
Ser Pro 



<210> 265 
<211> 370 
<212> DNA 

<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 117. .368 

<221> sig_peptide 

<222> 117. .350 

<223> Von Heijne matrix 

score 6.09999990463257 

seq LLWFLQTFFFGIA/SL 

<400> 265 

aaagcgcgct cccggggagg tgttgcagcc atggctacgg cagccggcgc gacctacttt 60 
cagcgaggca gtctgttctg gttcacagtc atcaccctca gctttggcta ctacac atg 119 

Met 



ggt 


tgt 


ctt 


ctg 


gcc 


tea 


gag 


tat 


ccc 


tta 


tea 


gaa 


cct 


tgg 


gcc 


cct 


167 


Gly Cys 


Leu 


Leu 


Ala 


Ser 


Glu 


Tyr 


Pro 


Leu 


Ser 


Glu 


Pro 


Trp 


Ala 


Pro 








-75 










-70 










-65 










ggg 


ccc 


ttc 


act 


cag 


tac 


ttg 


gtg 


gae 


cac 


eat 


cae 


ace 


etc 


ctg 


tgc 


215 


Gly Pro 


Phe 


Thr 


Gin 


Tyr 


Leu 


Val 


Asp 


His 


His 


His 


Thr 


Leu 


Leu 


Cys 






-60 










-55 










-50 












aat 


ggg 


tat 


tgg 


ctt 


gcc 


tgg 


ctg 


att 


eat 


gtg 


gga 


gag 


tee 


ttg 


tat 


263 


Asn 


Gly 


Tyr 


Trp 


Leu 


Ala 


Trp 


Leu 


He 


His 


Val 


Gly Glu 


Ser 


Leu 


Tyr 




-45 










-40 










-35 










-30 




gcc 


ata 


gta 


ttg 


tgc 


aag 


cat 


aaa 


ggc 


ate 


aca 


agt 


ggt 


egg 


get 


cag 


311 


Ala 


He 


Val 


Leu 


Cys 


Lys 


His 


Lys 


Gly 


He 


Thr 


Ser Gly Arg 


Ala 


Gin 












-25 










-20 










-15 






eta 


etc 


tgg 


tte 


eta 


cag 


act 


ttc 


tte 


ttt 


ggg 


ata 


gcg 


tct 


etc 


ase 


359 


Leu 


Leu 


Trp 


Phe 

-10 


Leu 


Gin 


Thr 


Phe 


Phe 
-5 


Phe 


Gly 


He 


Ala 


Ser 
1 


Leu 


Xaa 




ate 


ttg 


att 


gc 


























370 



He Leu He 
5 

<210> 266 
<211> 274 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 178. .273 

<221> sig_peptide 

<222> 178. .225 

<223> Von Heijne matrix 

score 6.09999990463257 

seq WIWVASILLRIFA/SV 

<400> 266 

tategtgaaa gaatattgaa ttttatcaaa tctttttttg 
tggttattat cettcattct gttgatgtga tgtatcacat 
aaccctcett gcatcectgg aatgattect actteattat 
atg tgc tgt tgg att tgg gtt get agt att ttg 
Met Cys Cys Trp He Trp Val Ala Ser He Leu 

-15 -10 
tct gtg tta ate agg gat att tac ctg tgg ttt 
Ser Val Leu He Arg Asp He Tyr Leu Trp Phe 
15 10 



tatctgttga gatgattaea 60 
ttattgattt gcatatgttg 120 
agtgtataat etttttg 177 
ttg aga att ttt gea 225 
Leu Arg He Phe Ala 
-5 

tct ttt ttt ttt ttt t 274 
Ser Phe Phe Phe Phe 
15 



<210> 267 
<211> 342 
<212> DNA 



wo 99/53051 

<213> Homo sapiens 



151 



PCT/IB99/00712 



<220> 
<221> CDS 
<222> 232. .342 

<22l> sig_j>eptide 

<222> 232. .300 

<223> Von Heijne matrix 

score 6.09999990463257 
seq LLFFXLWLRYKES/GR 

<221> misc_feature 
<222> 158 

<223> n=a, g, c or t 



<400> 267 

caagttatct caatttcttc tgagaagaaa tatagtttca aaatcaatca ataaagataa 60 
tcctctgata aagtaagatc tgaatataca aatcatggtt acagtaatct taccattata 12 0 
tataaattac ctctcaaaca aatgggccat tcagaarnrg gctcagagtg aattagctgg 180 
aggggttgtc aagggtcata gtttttactg ctttgaagag attatcactg g atg att 237 

Met He 

tec tea cat tta tat aac ttc agt etc ctg ttc ttt kta etc tgg ctg 285 
Ser Ser His Leu Tyr Asn Phe Ser Leu Leu Phe Phe Xaa Leu Trp Leu 

-20 -15 ^10 

agg tac aag gaa tea gga aga gag ggc aac tgt gag gaa gga gca ttc 333 
Arg Tyr Lys Glu Ser Gly Arg Glu Gly Asn Cys Glu Glu Gly Ala Phe 
-5 15 10 

tec agg tgg 342 
Ser Arg Trp 

<210> 268 
<211> 427 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 62 . .427 

<221> sig_peptide 

<222> 62 . . 112 

<223> Von Heijne matrix 

score 6.09999990463257 

seq RLLLLPRPPASTG/AS 

<400> 268 

agttgagtgg aaatgggcaa cggcgggcgg agcggeetge ageaggggaa ggggaacgtg 60 
g atg ggg tgg cag cga etc eta ctg ctg cct egg cet cct gee agt aca 109 
Met Gly Trp Gin Arg Leu Leu Leu Leu Pro Arg Pro Pro Ala Ser Thr 
-15 -10 -5 

ggt gca teg aat gca acc agg rrg cca aag agk ttg tac cga gre tat 157 
Gly Ala Ser Asn Ala Thr Arg Xaa Pro Lys Xaa Leu Tyr Arg Xaa Tyr 

15 10 15 

aac cac ggt gtg ctg aag ata acc ate tgt aaa tec tgc cag aaa cct 205 
Asn His Gly Val Leu Lys He Thr He Cys Lys Ser Cys Gin Lys Pro 

20 25 30 

gta gac aaa tat ate gag tat gat cct gtt ate ate ttg awk aat get 253 
Val Asp Lys Tyr He Glu Tyr Asp Pro Val He He Leu Xaa Asn Ala 

35 40 45 

ata ttg tgc aaa get cad gee tac agr cat att ett ttc aat act caa 301 
He Leu Cys Lys Ala Xaa Ala Tyr Arg His He Leu Phe Asn Thr Gin 
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50 55 60 

ata aat aac aaa ctg cct att tta ttg gca ttt tta cct tec tgt ggv 
He Asn Asn Lys Leu Pro He Leu Leu Ala Phe Leu Pro Ser Cys Gly 

65 70 75 

dga acg gcc cat gac ggc aaa aaa aag ccc aac ttc att ttg ctg ctg 
Xaa Thr Ala His Asp Gly Lys Lys Lys Pro Asn Phe He Leu Leu Leu 
80 85 90 95 

aaa sat tat tat tat eta get acg gaa aac 
Lys Xaa Tyr Tyr Tyr Leu Ala Thr Glu Asn 
100 105 

<210> 269 
<211> 143 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 20. .142 

<221> sig_peptide 
<222> 20. .76 
<223> Von Heijne matrix 
score 6 

seq LLALWRVILSTA/IL 
<400> 269 

ctttctttgc ggaatcacc atg gcg get ggg gta agt ttg ctg get ctg gtg 52 

Met Ala Ala Gly Val Ser Leu Leu Ala Leu Val 
-15 -10 
gtt egg gtc ate eta tec acc gcc ate ett tgc ecg agt ggg gee agt 100 
Val Arg Val He Leu Ser Thr Ala He Leu Cys Pro Ser Gly Ala Ser 

-5 15 
egg cge cag agg agt tet gag gtt gag tgg gga act gat teg g 143 
Arg Arg Gin Arg Ser Ser Glu Val Glu Trp Gly Thr Asp Ser 
10 15 20 

<210> 270 
<211> 79 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 23. .79 

<221> sig_peptide 
<222> 23. .67 
<223> Von Heijne matrix 
score 6 

seq PLFWLILCSGLLC/NK 

<221> misc_feature 
<222> 2. .3 

<223> n=a, g, e or t 



349 
397 
427 



<400> 270 

tnngctaate ttgettgtac tt atg aat cct tta ttc tgg ttg att etc tgc 52 

Met Asn Pro Leu Phe Trp Leu He Leu Cys 
-15 -10 

tct ggg tta tta tgt aac aag tea ttt 79 

Ser Gly Leu Leu Cys Asn Lys Ser Phe 
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-5 1 



<210> 271 
<211> 121 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> CDS 
<222> 61 . .120 



<221> sigjeptide 
<222> 61. .114 
<223> Von Heijne matrix 
score 6 

seq ISIFLSSLSLSLS/LF 



<400> 271 

cttccttaag aagcggtttc tctcccctct tttctctctc tcacctgggt ttgtttgtcc 60 
atg aga ggg get tgg ata agt ata ttt ctt tct tct eta tet etc tct 108 
Met Arg Gly Ala Trp lie Ser lie Phe Leu Ser Ser Leu Ser Leu Ser 
-15 -10 -5 

etc tct ctt ttt t 121 
Leu Ser Leu Phe 
1 



<210> 272 
<211> 292 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 196, .291 



<221> sig_peptide 
<222> 196 . .267 
<223> Von Heijne matrix 
score 6 

seq LFVLLPHFFLSPL/SP 



<400> 272 

ctcatggact gtggctgtct tattttatgt ctctaatacc agattatgaa aatcacagaa 60 
aaaaggaaaa aatattattt ccaaagagta agttatgaag ccatgttaga aacccatatg 120 
acaatatgaa tttcttttat ctgtcaatct caaggtagaa ttcctcatat ttctgataat 180 
gccaaatacc atgaa atg tct caa aaa aga ctt gac ttt ata tac cag ttg 231 
Met Ser Gin Lys Arg Leu Asp Phe He Tyr Gin Leu 
-20 -15 
ttt gtc ttg ctg cct cac ttc ttc ctt tct ttt ctt tct ccc ttt tat 279 
Phe Val Leu Leu Pro His Phe Phe Leu Ser Phe Leu Ser Pro Phe Tyr 

-10 -5 1 

ctg cac cca tgg g 292 
Leu His Pro Trp 
5 



<210> 273 
<211> 158 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 2. .157 



wo 99/53051 
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<221> sig^peptide 
<222> 2 . .100 
<223> Von Heijne matrix 
score 6 

seq LAHFLIGLTVCFG/EG 



<400> 273 

c atg tac ctg tac ctg ttg tec att tgt atg tct tct ttg aag aaa tgt 49 
Met Tyr Leu Tyr Leu Leu Ser lie Cys Met Ser Ser Leu Lys Lys Cys 
-30 -25 -20 

eta ttc aag ttc tta gcc cac ttt tta ate ggg tta aca gtt tgt ttt 97 
Leu Phe Lys Phe Leu Ala His Phe Leu lie Gly Leu Thr Val Cys Phe 

-15 -10 -5 

9gt gag ggr wgg eta atg agt tat agg agt tct tat tta tta ctt aaa 145 
Gly Glu Gly Xaa Leu Met Ser Tyr Arg Ser Ser Tyr Leu Leu Leu Lys 

15 10 15 

gga cca ccg ggg g 158 
Gly Pro Pro Gly 

<210> 274 
<211> 113 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 31. .111 

<221> sig_peptide 
<222> 31. .96 
<223> Von Heijne matrix 
score 6 

seq CLLVFLLTEWTSS/KL 
<400> 274 

cctttggtct ttgatgatgg tgacatacag atg ggg ttt tgg tgt gaa tgt cct 54 

Met Gly Phe Trp Cys Glu Cys Pro 
-20 -15 
ttc tgt ttg tta gtt ttc ctt eta aca gag tgg ace tct age aaa etc 102 
Phe Cys Leu Leu Val Phe Leu Leu Thr Glu Trp Thr Ser Ser Lys Leu 

-10 -5 1 

caa aag acg gg 113 
Gin Lys Thr 
5 

<210> 275 
<211> 254 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 182. .253 



<221> sig_peptide 
<222> 182. .247 
<223> Von Heijne matrix 
score 6 

seq VLHLFPLTPASTG/HW 



<400> 275 

cccatctgcc tgctcccact aggggtctga gtagcaggca ccgaagaagt gagccacgcc 60 
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ctcttcacac accctttgag gaggacaagg gaacttttcc tgtttcagaa agttgtgttg 120 
agaagaatgg caaggctaac agggcaggtg tccgggcgga ggggcggaac tggctgttgg 180 
c atg tgg tgg ggg aga tgc ttc ate egg gtc ttg eat ttg ttc act ctg 229 
Met Trp Trp Gly Arg Cys Phe lie Arg Val Leu His Leu Phe Pro Leu 
-20 -15 -10 

aca cca gcc teg aca gga cac tgg g 254 
Thr Pro Ala Ser Thr Gly His Trp 
-5 1 

<210> 276 
<211> 364 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 189. .362 

<221> sig_j>eptide 
<222> 189. .275 
<223> Von Heijne matrix 
score 6 

seq LFMALPPVLSSHG/SR 
<400> 276 

acttgeactt gcttgttggt gtcagagccc gtccctaaac cagggctcca tatgggctgc 60 
ctgtctgccg caacacagcc tagcggggaa acagtagaaa tgccacttct atgtatttat 120 
catatttatt ttgagataat taacgaagac gttaaataaa gccagactgc actgacccet 180 
ggggcgcc atg ega gac ccc etc gcg gac atg gta cac agt tat tta tea 23 0 

Met Arg Asp Pro Leu Ala Asp Met Val His Ser Tyr Leu Ser 
-25 -20 
teg tct ttg ttc atg gcc ett cca cca gtg ctg age tea cat ggc age 278 
Ser Ser Leu Phe Met Ala Leu Pro Pro Val Leu Ser Ser His Gly Ser 
-15 -10 -5 1 

agg aac ctg aga ate tgg ggg agt cca ttt ggt gga gcg ctg act aag 326 
Arg Asn Leu Arg lie Trp Gly Ser Pro Phe Gly Gly Ala Leu Thr Lys 

5 10 15 

ggc aaa gca ccc cea acc cca gca caa cca gcc ctg gg 364 
Gly Lys Ala Pro Pro Thr Pro Ala Gin Pro Ala Leu 
20 25 

<210> 277 

<211> 130 

<212> DNA- 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 46. .129 

<221> sig_peptide 
<222> 46. .96 
<223> Von Heijne matrix 
score 6 

seq WLCLPCSLCVSQL/LP 
<400> 277 

gtctttgcag gmvtgtttgt gctccaacag ggagctgagt ttgte atg age agt gcc 57 

Met Ser Ser Ala 
-15 

tgg ctg tgt ctg cca tgc tec ctg tgt gtg tec eag etc ctt ccc tct 105 
Trp Leu Cys Leu Pro Cys Ser Leu Cys Val Ser Gin Leu Leu Pro Ser 
-10 -5 1 
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tat tec ctg ttg ate eca gcc ccg g 130 
Tyr Ser Leu Leu lie Pro Ala Pro 
5 10 

<210> 278 

<211> 184 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 103 , .183 

<221> sig_peptide 
<222> 103 , .165 
<223> Von Heijne matrix 

score 5.90000009536743 

seq LSLLGPLXPPMRA/CS 

<400> 278 

cattatgttg acatttctag ctacaaggcc agtattttac aaaataaggc cttttccctt 
aattaaggtt gtgacagata aaagtatatt cccagctgac tc atg tea cce atg 

Met Ser Pro Met 
-20 

tgg gca ggc eta tta tec eta ctt ggc ceg etc wgt ccg eet atg agg 
Trp Ala Gly Leu Leu Ser Leu Leu Gly Pro Leu Xaa Pro Pro Met Arg 

-15 -10 -5 

get tgc tct gtg tgc gta etc t 
Ala Cys Ser Val Cys Val Leu 
1 5 

<210> 279 
<211> 265 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 149. .265 

<221> sig_peptide 
<222> 149. .202 
<223> Von Heijne matrix 

score 5.90000009536743 

seq LSIADLLPSSSFA/NP 

<400> 279 

tgageatttc eeaaeatatc tgttgagttt ttaactcttt ttatgatett ttttatttet 60 
aggagttcta tttggttctt ttccaagtca gctatgtcat gttttaawag tttecwgtcc 120 
ttdgcrrtwe twttctawct tgaggttt atg tet tta aac gag tta age ata 172 

Met Ser Leu Asn Glu Leu Ser lie 
-15 

get gat tta tta cce age tea tec ttt get aat cce aag ctg agt ggg 220 
Ala Asp Leu Leu Pro Ser Ser Ser Phe Ala Asn Pro Lys Leu Ser Gly 
-10 -5 15 

ceg att tct ate teg gtc act tea get ggt tct eet eec ggg geg 265 
Pro He Ser He Ser Val Thr Ser Ala Gly Ser Pro Pro Gly Ala 
10 15 20 

<210> 280 
<211> 188 
<212> DNA 
<213> Homo sapiens 



60 
114 

162 

184 
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<220> 
<221> CDS 
<222> 110. .187 



<221> sigjpeptide 

<222> 110. .154 

<223> Von Heijne matrix 

score 5.90000009536743 
seq DLLGTAFLEGSLA/AY 



<400> 280 

taataataat aataataaat ttttctgtta gattagtaga tgttaagatt atggacaaaa 60 
tccaacgtag attggagtat agagaagtgg acctttctgt gtggtcttg atg aaa gac 118 

Met Lys Asp 
-15 

tta ctt ggc act gcc ttt ctg gag gga agt tta gca gca tat etc acc 166 
Leu Leu Gly Thr Ala Phe Leu Glu Gly Ser Leu Ala Ala Tyr Leu Thr 

'10 -5 1 

atg gcc aat ata acc cat gtg g 188 
Met Ala Asn lie Thr His Val 
5 10 

<210> 281 
<211> 177 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 91. .177 



<22l> sigjpeptide 

<222> 91. .147 

<223> Von Heijne matrix 

score 5.90000009536743 
seq HLFMCLLTICISS/LE 



<400> 281 

gccaacttgt tttttgctgt agttttgatt agagccattc tactgggtgt gaagtgatat 60 
tttgatgttg ttttgatttg catttccttg atg get aat gac att aag cat ctt 114 

Met Ala Asn Asp lie Lys His Leu 

-15 

ttc atg tgc tta ttg acc ata tgt ata tct tct ttg gag aaa ctt cca 162 
Phe Met Cys Leu Leu Thr lie Cys lie Ser Ser Leu Glu Lys Leu Pro 

-10 -5 15 

ttc ttt ttt ttt ttt 177 
Phe Phe Phe Phe Phe 
10 



<210> 282 
<211> 336 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 42. .335 



<221> sig_peptide 

<222> 42. .113 

<223> Von Heijne matrix 

score 5.90000009536743 
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seq ATLVGFTVGSVLG/QI 



<400> 282 

ttgcctatta ctcttatatc tacagtgtgg tggacctggg c atg tac cag aaa gtc 56 

Met Tyr Gin Lys Val 
-20 



aca 


agt 


tac 


tat 


cga 


agt 


gcc 


act 


tta 


ota 


aac 


ttt 


aca 


gtg 


ggc 


tct 


104 


Thr 


Ser 


Tvr 


Cys 


Ara 


Ser 


Ala 


Thr 


Leu 


Val 


Glv 


Phe 


Thr 


Val 


Gly 


Ser 












-15 










-10 










-5 






gtc 


eta 


ggg 


caa 


ate 


ctt 


gtc 


tea 


gtg 


gca 


ggc 


tgg 


teg 


ctg 


ttc 


age 


152 


Val 


Leu 


Gly 


Gin 
1 


He 


Leu 


Val 


Ser 

5 


Val 


Ala 


Gly 


Trp 


Ser 


Leu 


Phe 


Ser 




ctg 


aat 


gtc 


ate 


tct 


ctt 


acc 


tgt 


gtt 


tea 


gtg 


get 


10 
ttt 


get 


gtg 


gcc 


200 


Leu 


Asn 


Val 


He 


Ser 


Leu 


Thr 


Cys 


Val 


Ser 


Val 


Ala 


Phe 


Ala 


Val 


Ala 






15 










20 










25 












tgg 


ttt 


tta 


cct 


atg 


cca 


cag 


aag 


age 


etc 


ttc 


ttt 


eac 


eac 


att 


cct 


248 


Trp 


Phe 


Leu 


Pro 


Met 


Pro 


Gin 


Lys 


Ser 


Leu 


Phe 


Phe 


His 


His 


He 


Pro 




30 










35 










40 










45 




tct 


acc 


tgc 


cag 


aga 


gtg 


aat 


ggc 


ate 


aag 


gta 


caa 


aat 


ggt 


ggc 


att 


296 


Ser 


Thr 


Cys 


Gin 


Arg 


Val Asn Gly 


He 


Lys 


Val 


Gin 


Asn 


Gly Gly 


He 












50 










55 










60 






gtt 


act 


gac 


acc 


cag 


ctt 


eta 


aca 


cct 


tee 


tgg 


ctg 


gga 


g 






336 


Val 


Thr 


Asp 


Thr 


Gin 


Leu 


Leu 


Thr 


Pro 


Ser 


Trp 


Leu 


Gly 











65 70 



<210> 283 
<211> 294 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 238. .294 



<22l> sig__peptide 

<222> 238. .288 

<223> Von Heijne matrix 

score 5.90000009536743 
seq ALPPLLRIAWLLG/LF 

<221> misc_feature 
<222> 227 

<223> n=a, g, c or t 



<400> 283 

acatacacgt caattaatct gattcatccc ataaacaaaa 
attatctcaa tatatgcaga aaaggctttc aataaaattc 
actctaaaaa atctgggtat tgaggaarca tagctcaaaa 
cagtatcatg ctgttttggt taetgtagcc ctgtagtata 
atg atg cct cca get ttg ttc ttt ttg ctg agg 
Met Met Pro Pro Ala Leu Phe Phe Leu Leu Arg 

-15 -10 
ggg etc ttt 
Gly Leu Phe 
1 



ctaaagataa aaaccatgtg 
aaggcctctc catattaaaa 
gtgatgrgct gtttttgtac 
gtttgangtt gggtaac 
att get tgg eta tta 
He Ala Trp Leu Leu 
-5 



60 
120 
180 
237 
285 



294 



<210> 284 
<211> 203 
<212> DNA 
<213> Homo sapiens 



<220> 
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<221> CDS 
<222> 90. .203 



<221> sig_peptide 

<222> 90. .152 

<223> Von Heijne matrix 

score 5.90000009536743 
seq ALSIiWASVSPSWM/CR 



<400> 284 

catctttcgg cctagatgga ggaaccgtgt gctggctggg caggctgctg gcagaggtca 60 
ggagggctct tccctgagcc ctgccatcc atg aac tgt gta act ttg ate cag 113 

Met Asn Cys Val Thr Leu lie Gin 
-20 -15 
gcc ttg tec etc tgg gee tea gtt tec cea age tgg atg tgt egt cec 161 
Ala Leu Ser Leu Trp Ala Ser Val Ser Pro Ser Trp Met Cys Arg Pro 

-10 -5 1 

act get tea ttc ata ate aee ace ace ace acc aee tgc ggg 203 
Pro Ala Ser Phe lie lie Thr Thr Thr Thr Thr Thr Cys Gly 
5 10 15 



<210> 285 
<211> 297 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 240. .296 



<221> sig_peptide 

<222> 240. .287 

<223> Von Heijne matrix 

score 5.90000009536743 
seq LLSLMARTDLVFC/SP 

<221> misc_feature 
<222> 107 

<223> n=a, g, c or t 



<400> 285 

aggcattgtc taggctgctg ggcacatgag ctccgggatg cccatgtcct ctggccaggc 60 
agaeaeagae ctggggeagc aecagetttc tgatggeagc ctgctcnttc eaacagttee 120 
etaecagaat cctgcctcae tggagcagag gatgeeagea teagceggga aeeaetcctg 180 
tgetaaaace gcettggtgg eetgtggctt gaggtettga tgcggatgaa gceggagga 239 
atg ttg tet etc etc agt etc atg gca agg act gat ctt gtt ttc tgt 2 87 

Met Leu Ser Leu Leu Ser Leu Met Ala Arg Thr Asp Leu Val Phe Cys 

-15 -10 -5 

tec cea egg g 297 
Ser Pro Arg 
1 



<210> 286 

<211> 774 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 8. .772 



<221> 8ig_peptide 
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<222> 8. .109 

<223> Von Heijne matrix 

score 5.90000009536743 
seq MAVLAPLIALVYS/VP 

<221> misc_f eature 

<222> 486,565 

<223> n=a, g, c or t 



<400> 286 



agtcgtt 


atg 


gtg 


ggg 


gag 


gcg 


ggg 


ega 






Met 


Val 


Gly Glu 


Ala 
-30 


Gly Arg 


QtQ 


gee 


gtt 


acQ 


gcc 


gaa 


aag 


atg 


gcg 


Val 


Ala 


Val 


Thr 


Ala 


Glu 


Lvs 


Met 


Ala 


-20 










-15 








etc 


gtg 


tat 


teg 


gtg 


ccg 


ega 


ctt 


tea 


Leu 


Val 


Tyr 


Ser 


Val 
1 


Pro 


Arg 


Leu 


Ser 
5 


tac 


ctt 


ctg 


teg 


gcc 


ctg 


etc 


tct 


get 


Tyr 


Leu 


Leu 


Ser 


Ala 


Leu 


Leu 


Ser 


Ala 






15 










20 




ctg 


ccg 


ccg 


etc 


tgc 


cac 




ctg 


ccc 


Leu 


Pro 


Pro 


Leu 


Cys 


His 


Glv 


Leu 


Pro 




30 










35 






ccg 


tgt 


gac 


ttt 


gac 


tgg 


aaa 


gaa 


gtg 


Pro 


Cys 


Asp 


Phe 


Asp 


Trp 


Ara 


Glu 


Val 


45 










50 








gcc 


att 


gtg 


atg 


atg 


aag 


aac 


cgc 


aga 


Ala 


He 


Val 


Met 


Met 


Lys 


Asn 


Arg 


Arg 










65 










ata 


ggc 


aac 


att 


ttc 


atg 


ttt 


agt 


aaa 


lie 


Gly Asn 


He 


Phe 


Met 


Phe 


Ser 


Lys 








80 










85 


ttc 


cgc 


ttg 


gat 


att 


cgc 


atq 


ggc 


eta 


Phe Arg Leu Asp 


He Arg 


Met 


Gly Leu 






95 










100 




gtg ttc 


ctg atg 


acg 


tgc 


aaa 


ccc 


ccc 


Val 


Phe 


Leu 


Met 


Thr Cys 


Lvs 


Pro 


Pro 




110 










115 






ate 


ang 


tac 


ttc 


aat 


gat 


aaa 


acc 


att 


He 


Xaa 


Tyr 


Phe 


Asn Asp 


Lys 


Thr 


He 


125 










130 








aag 


agg 


gtc 


act 


tgg 


att 


gtg 


gag 


ttc 


Lys 


Arg 


Val 


Thr 


Trp 


He 


val 


Glu 


Phe 










145 










tgc 


caa 


tea 


ttt 


gcc 


cct 


ate 


tat 


get 


Cys 


Gin 


Ser 


Phe 


Ala 


Pro 


He 


Tyr Ala 








160 










165 


tgt 


aca 


ggg 


eta 


aat 


ttt 


ggg 


aag 


gtg 


Cys 


Thr Gly Leu 


Asn 


Phe 


Gly 


Lys 


Val 






175 










180 




gtt 


agt 


acg 


egg 


tac 


aaa 


gtg 


age 


aca 


Val 


Ser 


Thr 


Arg 


Tyr 


Lys 


Val 


Ser 


Thr 




190 










195 






cct 


acc 


ctg 


ate 


ctg 


ttc 


caa 


ggt 


ggc 


Pro 


Thr 


Leu 


He 


Leu 


Phe 


Gin 


Gly Gly 


205 










210 








cag 


at 
















Gin 



















160 



qac 


eta 


ega 


cgc 


egg 


ega gea 


4 9 


ASD 


Leu 


Arg Arg Arg Arg Ala 








-25 










ate 


tta 


gea 


cct 


eta 


att get 


97 


Val 


Leu 


Ala 


Pro 


Leu 


He Ala 






-10 








-5 




ega 


t'oa 
»-yy 


etc 


gcc 


caa 


cct tac 








Leu 


Ala 


Gin 


Pro Tyr 












10 






gee 


ttc 


eta 


etc 


gtg 


agg aaa 


193 


Ala 


Phe 


Leu 


Leu 


Val 


Arg Lys 










25 








acc 


caa 


cgc 


gaa 


gac 


ggt aac 


241 


Thr 


Gin 


Arg Glu Asp Gly Asn 








40 










aaa 
yay 


ate 


ctg 


atg 


ttt 


etc agt 


289 


Glu 


He 


Leu 


Met 


Phe 


Leu Ser 






55 








60 




tec 


ate 


act 


gtg 


gag 


caa cat 


337 


Ser 


He 


Thr 


Val 


Glu 


Gin His 




70 










75 






qce 


aac 


aca 


att 


ctt ttc 


385 


Val 


Ala 


Asn 


Thr 


He 


Leu Phe 












90 






ctt 


tac 


ate 


aca 


etc 


tgc ata 


433 


Leu 


Tyr 


He 


Thr 


Leu 


Cys He 










105 








eta 


tat 


atg 


ggc 


cct 


gag tat 


481 


Leu 


Tvr 


Met 


Gly Pro Glu Tyr 








120 










gat 


gag 


gaa 


eta 


gaa 


egg gac 


529 


Asp 


Glu 


Glu 


Leu 


Glu 


Arg Asp 






135 








140 




ttt 


gcc 


aan 


tgg 


tct 


aat gac 


577 


Phe 


Ala 


Xaa Trp Ser Asn Asp 




150 










155 




gac 


etc 


tec 


ctt 


aaa 


tac aac 


625 


Asp 


Leu 


Ser 


Leu 


Lys 


Tyr Asn 












170 






gat 


gtt 


gga 


cgc 


tat 


act gat 


673 


Asp 


Val 


Gly Arg Tyr Thr Asp 










185 








tea 


ccc 


etc 


acc 


aag 


caa etc 


721 


Ser 


Pro 


Leu 


Thr 


Lys 


Gin Leu 








200 










aag 


gag 


gca 


atg 


egg 


egg cea 


769 


Lys 


Glu 


Ala 


Met 


Arg Arg Pro 






215 








220 





774 



<210> 287 
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<211> 614 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 155. .613 

<22l> sig_peptide 

<222> 155. .205 

<223> Von Heijne matrix 

score 5.80000019073486 

seq LWLKLLAFGFAFL/DT 

<400> 287 

aaaaaccgaa tctgacatca tcacctagca gttcatgcag ctagcaagtg gtttgttctt 60 
agggtaacag aggaggaaat tgttcctcgt ctgataagac aacagtggag aaaggacgca 120 
tgctgtttct tagggacacg gctgacttcc agat atg acc atg tat ttg tgg ctt 175 

Met Thr Met Tyr Leu Trp Leu 

-15 



aaa 


etc 


ttg 


gca 


ttt 


ggc 


ttt 


gcc 


ttt 


ctg 


gac 


aca 


gaa 


gta 


ttt 


gtg 


223 


Lys 


Leu 


Leu 


Ala 


Phe Gly 


Phe 


Ala 


Phe 


Leu 


Asp 


Thr 


Glu 


Val 


Phe 


Val 




-10 










-5 










1 








5 






aca 


ggg 


caa 


age 


eca 


aca 


ect 


tec 


eee 


act 


gga 


ttg 


act 


aca 


gca 


aag 


271 


Thr Gly 


Gin 


Ser 


Pro 


Thr 


Pro 


Ser 


Pro 


Thr 


Gly 


Leu 


Thr 


Thr 


Ala 


Lys 










10 










15 










20 








atg 


ccc 


agt 


gtt 


cca 


ctt 


tea 


agt 


gac 


ccc 


tta 


ect 


act 


cac 


ace 


act 


319 


Met 


Pro 


Ser 
25 


val 


Pro 


Leu 


Ser 


Ser 
30 


Asp 


Pro 


Leu 


Pro 


Thr 
35 


His 


Thr 


Thr 




gca 


ttc 


tea 


ccc 


gca 


age 


ace 


ttt 


gaa 


aga 


gaa 


aat 


gac 


ttc 


tea 


gag 


367 


Ala 


Phe 
40 


Ser 


Pro 


Ala 


Ser 


Thr 
45 


Phe 


Glu 


Arg 


Glu 


Asn 
50 


Asp 


Phe 


Ser 


Glu 




acc 


aca 


act 


tet 


ctt 


agt 


cca 


gac 


aat 


act 


tec 


acc 


caa 


gta 


tec 


ccg 


415 


Thr 


Thr 


Thr 


Ser 


Leu 


Ser 


Pro 


Asp 


Asn 


Thr 


Ser 


Thr 


Gin 


Val 


Ser 


Pro 




55 










60 










65 










70 




gac 


tot 


ttg 


gat 


aat 


get 


agt 


get 


ttt 


ark 


acc 


aca 


ggt 


gtt 


tea 


tea 


463 


Asp Ser 


Leu Asp 


Asn 


Ala 


Ser 


Ala 


Phe 


Xaa 


Thr 


Thr 


Gly 


Val 


Ser 


Ser 












75 










80 










85 






gta 


cag 


a eg 


eet 


eas 


ctt 


ccc 


acg 


cac 


gca 


gac 


teg 


cag 


acg 


ccc 


tet 


511 


Val 


Gin 


Thr 


Pro 
90 


Xaa 


Leu 


Pro 


Thr 


His 
95 


Ala 


Asp 


Ser 


Gin 


Thr 
100 


Pro 


Ser 




get 


gga 


act 


gac 


acg 


cag 


aca 


ttc 


age 


ggc 


tec 


gcg 


sea 


atg 


caa 


aac 


559 


Ala Gly 


Thr 


Asp 


Thr 


Gin 


Thr 


Phe 


Ser 


Gly 


Ser 


Ala 


Xaa 


Met 


Gin 


Asn 








105 










110 










115 










tea 


acc 


eta 


ccc 


cag 


gca 


gca 


atg 


eta 


tet 


cag 


atg 


tec 


cag 


gag 


aga 


607 


Ser 


Thr 

120 


Leu 


Pro 


Gin 


Ala 


Ala 
125 


Met 


Leu 


Ser 


Gin 


Met 
130 


Ser 


Gin 


Glu 


Arg 





gga gta e 614 

Gly Val 

135 

<210> 288 
<211> 251 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 124 . .249 

<221> sig_peptide 

<222> 124 . .174 

<223> Von Heijne matrix 
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score 5.80000019073486 
seq LWLKLLAFGFAFL/DT 

<400> 288 

atttttattt acttttacat ttttgattcg tttttacaga gaaaaacttc tacagagata 60 

acaattattt tgcttttcag aaggacgcat gctgtttctt agggacacgg ctgacttcca 120 

gat atg acc atg tat ttg tgg ctt aaa etc ttg gca ttt ggc ttt gcc 168 

Met Thr Met Tyr Leu Trp Leu Lys Leu Leu Ala Phe Gly Phe Ala 
-15 -10 -5 

ttt ctg gac aca gaa gta ttt gtg aca ggg caa age cca aca cct tec 216 

Phe Leu Asp Thr Glu Val Phe Val Thr Gly Gin Ser Pro Thr Pro Ser 

15 10 

cec act ggt gtt tea tea gta cag acg ccc cag gg 251 
Pro Thr Gly Val Ser Ser Val Gin Thr Pro Gin 
15 20 25 

<210> 289 
<211> 416 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 155. .415 

<221> sig_jpeptide 

<222> 155. .205 

<223> Von Heijne matrix 

score 5,80000019073486 

seq LWLKLLAFGPAFL/DT 

<400> 289 

aaaaaccgaa tctgacatca tcacctagea gttcatgcag ctagcaagtg gtttgttctt 60 
agggtaacag aggaggaaat tgttcctcgt ctgataagac aacagtggag aaaggacgca 120 
tgctgtttct tagggacacg gctgacttcc agat atg acc atg tat ttg tgg ctt 175 

Met Thr Met Tyr Leu Trp Leu 
-15 

aaa etc ttg gca ttt ggc ttt gcc ttt ctg gac aca gaa gta ttt gtg 223 

Lys Leu Leu Ala Phe Gly Phe Ala Phe Leu Asp Thr Glu Val Phe Val 

-10 -5 15 

aca ggg caa age cca aca cct tec ccc act ggt gtt tea tea gta cag 271 

Thr Gly Gin Ser Pro Thr Pro Ser Pro Thr Gly Val Ser Ser Val Gin 

10 15 20 

acg cct cac ctt ccc acg cac gca gac teg cag acg ccc tct get gga 319 
Thr Pro His Leu Pro Thr His Ala Asp Ser Gin Thr Pro Ser Ala Gly 

25 30 35 

act gac acg cag aca tte age ggc tec gcg sea atg caa aac tea acc 367 
Thr Asp Thr Gin Thr Phe Ser Gly Ser Ala Xaa Met Gin Asn Ser Thr 

40 45 50 

eta ccc cag gca gca atg eta tct cag atg tec cag gag aga gga gta c 416 
Leu Pro Gin Ala Ala Met Leu Ser Gin Met Ser Gin Glu Arg Gly Val 
55 60 65 70 

<210> 290 
<211> 309 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 183. .308 



<221> sig_peptide 
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<222> 183. .290 

<223> Von Heijne matrix 

score 5.80000019073486 

seq LFLLSGTIWIAIC/KP 

<400> 290 

gaggcctttt ggtcacttag aagaggtgcc aaagatcaag gagaggaaag tggtgggcta 60 
caaatgtaaa ttctgtgtgg aagtgcaccc aacgctccga gccatctgca atcacctccg 120 
wwagcacgtc cagtatggca atgtcccagc tgtgtcagct gctgtgaagg ggctgcgttc 180 
tc atg aga gga gcc acc tgg ccc tgg cca tgt tta ccc gcg agg aca 227 
Met Arg Gly Ala Thr Trp Pro Trp Pro Cys Leu Pro Ala Arg Thr 
-35 -30 -25 

agt aca get gcc agt att get cgt ttg ttt ctg ctt tea ggc aca att 275 
Ser Thr Ala Ala Ser lie Ala Arg Leu Phe Leu Leu Ser Gly Thr lie 

-20 -15 -10 

tgg ate gee ata tgc aaa ccc acc acg aac ggg g 309 
Trp lie Ala lie Cys Lys Pro Thr Thr Asn Gly 
-5 15 

<210> 291 
<211> 359 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> CDS 
<222> 142 . .357 

<221> sig_j)eptide 

<222> 142. .333 

<223> Von Heijne matrix 

score 5.80000019073486 

seq SALLRSLLLPXLX/QI 

<221> misc_feature 
<222> 282 . .283 
<223> n=a, g, c or t 



<400> 291 

caagtccaea gcgctgggtg cagtacctcc ggcttcttea ggagtccatc tggcctggtg 60 
gagttttgcc taagtttcca cggcccsgta aggacccaag agcagaaact ggctgctgag 120 
aaacaggctt tgcagagcct g atg gga gtc etc cca gat etc gta gta gaa 171 

Met Gly Val Leu Pro Asp Leu Val Val Glu 
-60 -55 
att ttt ggg gtg aac aaa tgc egg ctg age tgg ggt eta gtc ctg gag 219 
lie Phe Gly Val Asn Lys Cys Arg Leu Ser Trp Gly Leu Val Leu Glu 

-50 -45 -40 

tea eta caa eaa ccc etc ate aac agg eat ttg att tae tgc ctt ggg 267 
Ser Leu Gin Gin Pro Leu lie Asn Arg His Leu lie Tyr Cys Leu Gly 

-35 -30 -25 

gac ate ate ctg grn ntc ttg gat etc agt get ctg ttg agg agt ctg 315 
Asp lie lie Leu Xaa Xaa Leu Asp Leu Ser Ala Leu Leu Arg Ser Leu 

-20 -15 -10 

Ctg eta cca set ctg set cag ata ccc cag gca act eta aga gg 359 
Leu Leu Pro Xaa Leu Xaa Gin He Pro Gin Ala Thr Leu Arg 
-5 15 

<210> 292 
<211> 254 
<212> DNA 
<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 129. .254 



<221> sig_peptide 

<222> 129. .173 

<223> Von Heijne matrix 

score 5.80000019073486 
seq ALGFVLLAPRGWG/SL 



<400> 292 

gtttttcagc tcgccattca cttcgctgtg aagatggcgt cgggcagcgg gacaaaaaac 60 
ttggactttc gccgaaagtg ggatgtggga aggtgggcag ggaccagatc aaaggagaca 120 
gccaggag atg aca gca ctg ggg ttt gtt ctg tta get cca cgt ggc tgg 170 

Met Thr Ala Leu Gly Phe Val Leu Leu Ala Pro Arg Gly Trp 

-15 -10 -5 

ggg age etc aea gte atg gtg gaa gge aag gaa gag caa gte aeg tct ' 218 
Gly Ser Leu Thr Val Met Val Glu Gly Lys Glu Glu Gin Val Thr Ser 

15 10 15 

tae acg gat gge age agg caa aga gac age aat ttt 254 
Tyr Thr Asp Gly Ser Arg Gin Arg Asp Ser Asn Phe 
20 25 



<210> 293 
<211> 414 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 221. .412 



<22l> sig_peptide 

<222> 221. .337 

<223> Von Heijne matrix 

score 5.80000019073486 
seq LISFLHTLQWCS/VI 



<400> 293 

gtcagagcac atccggtgtt agaagcgctg gtaggccttg gagaggcggg ttaggaagag 60 
tggagactgc tgcacggact ctggaaccat gaacatattt gatcgaaaga tcaactttga 120 
tgcgetttta aaattttctc atataacccc gtcaacgcag cagsrcctga agaagatttc 180 
attactgtct tcagaaaact catgatgate ctggccatga atg aaa agg ata aga 235 

Met Lys Arg lie Arg 
-35 



aga 


aag 


aga 


aga 


aat 


gaa 


gtg 


ace 


ate 


eag 


cct 


ttc 


cca 


att 


aga 


ctt 


283 


Arg Lys 


Arg Arg Asn 


Glu 


Val 


Thr 


He 


Gin 


Pro 


Phe 


Pro 


He 


Arg 


Leu 












-30 










-25 










-20 






cct 


etc 


ctt 


cca 


ccc 


etc 


att 


tec 


ttt 


ttg 


cac 


aca 


tta 


cag 


gtg 


gtg 


331 


Pro 


Leu 


Leu 


Pro 


Pro 


Leu 


He 


Ser 


Phe 


Leu 


His 


Thr 


Leu 


Gin 


Val 


Val 










-15 










-10 










-5 








tgt 


tct 


gtg 


ata 


atg 


aaa 


age 


ate 


aga 


aaa 


get 


ttt 


gta 


ctt 


tgt 


ggt 


379 


Cys 


Ser 


Val 
1 


lie 


Met 


Lys 


Ser 

5 


He 


Arg 


Lys 


Ala 


Phe 


Val 


Leu 


Cys 


Gly 




ttc 


etc 


tat 


ttt 


gaa 


ttt 


ttt 


gat 


caa 


aaa 


ctg 


10 
at 










414 


Phe 


Leu 


Tyr 


Phe 


Glu 


Phe 


Phe 


Asp 


Gin 


Lys 


Leu 














15 










20 










25 















<210> 294 
<211> 334 
<212> DNA 
<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 156- .332 



<221> sig__peptide 

<222> 156. .221 

<223> Von Heijne matrix 

score 5.80000019073486 
seq XVXLPXXVXVXXA/AL 



<400> 294 

catgtgtttc tcatctttca accaccaaac ttacataaaa ttcatccacc ttcccattct 60 
tcctattata gtacaggaaa ggtccttcct gtcaaaggca aaatcactta tgattgtgtc 120 
cccatctctc ttgccttttc aaggactttg agcct atg ctg cca ctg ctt cat 173 

Met Leu Pro Leu Leu His 

-20 

tgt ttt ttt ttk gtt kgt ttg ttt kgt ttk gtt ttk gtt twa ama gca 221 
Cys Phe Phe Xaa Val Xaa Leu Phe Xaa Xaa Val Xaa Val Xaa Xaa Ala 

-15 -10 -5 

get tta ttg aga tat aat yea agt ata cag kgt ggc egg gca cag kgg 269 
Ala Leu Leu Arg Tyr Asn Xaa Ser lie Gin Xaa Gly Arg Ala Gin Xaa 
15 10 15 

etc ama act gwa ate cca gma ctt tgg gag act aag gma ggc aga tta 317 
Leu Xaa Pro Xaa He Pro Xaa Leu Trp Glu Thr Lys Xaa Gly Arg Leu 

20 25 30 

ctt gag ect agg aat tt 334 
Leu Glu Pro Arg Asn 
35 



<210> 295 

<211> 375 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> 284 . .373 



<221> sig_peptide 

<222> 284 . .346 

<223> Von Heijne matrix 

score 5.80000019073486 
seq EMCLSQLFLQVEA/AY 



<400> 295 

ataacgetac ctgcegagca agcegagtgg agggaggcaa aggccaggct gaggatcagg 60 
gtggcceggg tggeagcggg gaggcgctgc atgctggagg etgtgctgag tgcceggtgc 120 
aggtgagccg gtcetgcgga gttgtgccga gtgcctgctg cagtctcatt teeagttcct 180 
ccatgacgtg gcaagtgaag acaggaatga aaggratgta aagcagcttt tctctgaaga 240 
gaagaagaga gagagacaca gccaagaccg aggctgggcc aag atg gtg tct gtg 295 

Met Val Ser Val 
-20 

ttt cga age gag gag atg tgt ttg tea caa ctg ttt etc cag gtg gaa 343 
Phe Arg Ser Glu Glu Met Cys Leu Ser Gin Leu Phe Leu Gin Val Glu 

-15 -10 -5 

get gca tat tgc tgt gtg get gag etc gga ga 375 
Ala Ala Tyr Cys Cys Val Ala Glu Leu Gly 
1 5 



<210> 296 
<211> 226 
<212> DNA 

<213> Homo sapiens 
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<220> 
<22a> CDS 
<222> 104 . .226 

<221> sig_peptide 

<222> 104 . .187 

<223> Von Heijne matrix 

score 5.80000019073486 

seq WWSVASLLSDVAA/WW 

<400> 296 

tggattgaaa taaattccta gctccacggt caggtcagta ggctgccatg atgaaatttg 60 
aagaagagtc tgttatgatg tgtaatacca atttctggag ggc atg get get etc 115 

Met Ala Ala Leu 
-25 

cga agt act eta aca tgg aca gaa gtc gtg gge tgg tgg agt gtt geg 163 
Arg Ser Thr Leu Thr Trp Thr Glu Val Val Gly Trp Trp Ser Val Ala 

-20 -15 _io 

teg ctg ctt agt gat gtg gea gea tgg tgg cea ecg cae tec ace tea 211 
Ser Leu Leu Ser Asp Val Ala Ala Trp Trp Pro Pro His Ser Thr Ser 

-5 1 5 

aca egg gga ggg gta 226 
Thr Arg Gly Gly Val 
10 

<210> 297 
<211> 232 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 90, .230 

<221> sigjpeptide 

<222> 90. .221 

<223> Von Heijne matrix 

score 5.80000019073486 

seq LVCVFTCSLLAFF/SP 

<400> 297 

ctgaactttt tatttttcta tttttataac eaggagaaag taaacacata cacacacatg 60 
gatggagaga gggacagagg gatggacgg atg aat gea ttr gta gat ggg aaa 113 

Met Asn Ala Leu Val Asp Gly Lys 
-40 

egg ctt asa krg tge ata cgc tat ttc gat tet ate tea eta tat tet 161 
Arg Leu Xaa Xaa Cys He Arg Tyr Phe Asp Ser He Ser Leu Tyr Ser 

-35 -30 -25 

aag gea agt tta agt tgt tgt tta gtg tgt gtg ttt act tgt tea ttg 209 
Lys Ala Ser Leu Ser Cys Cys Leu Val Cys Val Phe Thr Cys Ser Leu 
-20 -15 -10 _5 

eta get ttc ttc age cea tge ac 232 
Leu Ala Phe Phe Ser Pro Cys 
1 

<210> 298 
<211> 258 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
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<221> sig_peptide 
<222> 7. .63 

<223> Von Heijne matrix 

score 5.80000019073486 
seq WVFLVAILKGVQC/EL 

<400> 298 

ccaacc atg gag ttt ggg ctt age tgg gtt ttc ctt gtt get att ttg 48 
Met Glu Phe Gly Leu Ser Trp Val Phe Leu Val Ala He Leu 











-15 






-10 




aaa 


ggt 


gtc caa tgt 


gaa 


ctg 


cag 


gtg 


gtg gag tct ggg gga ggc ttg 


96 


Lys Gly 


Val Gin Cys 


Glu 


Leu 


Gin 


Val 


Val Glu Ser Gly Gly Gly Leu 




-5 






1 








5 10 




gta 


cag 


cca ggg egg 


tec 


etc 


aga 


etc 


tec tgt cga act tct gga ttc 


144 


Val 


Gin 


Pro Gly Arg 


Ser 


Leu Arg 


Leu 


Ser Cys Arg Thr Ser Gly Phe 








15 








20 


25 




gcc 


ttt 


gat gat tat 


aat 


ttg 


agt 


tgg 


gtc cgc cag get cca ggg aag 


192 


Ala 


Phe 


Asp Asp Tyr 


Asn 


Leu 


Ser Trp Val Arg Gin Ala Pro Gly Lys 








30 






35 




40 




ggg 


ctg 


gag tgg gta 


ggt 


ttc 


att 


aga 


age aaa ect tat ggt gag aca 


240 


Gly 


Leu 


Glu Trp Val 


Gly 


Phe 


He 


Arg 


Ser Lys Pro Tyr Gly Glu Thr 






45 






50 






55 




aca 


acg 


tac gcc gcg 


tgg 










258 


Thr 


Thr 


Tyr Ala Ala 


Trp 












60 






65 













<210> 299 
<211> 139 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 83 . . 139 

<221> sig_peptide 

<222> 83. .124 

<223> Von Heijne matrix 

score 5.80000019073486 

seq SLFXLXXLRQSFT/XX 

<400> 299 

tttgggaget ccagtgttag gegcatatat atttyagaat tgtgacaatt tcctgttggt 
ttagtcctyt tatcattata ta atg tec etc ttt gwc ctt yyt yyt ttg aga 

Met Ser Leu Phe Xaa Leu Xaa Xaa Leu Arg 
-10 -5 

cag agt ttc act cht gwt gee cag gea 
Gin Ser Phe Thr Xaa Xaa Ala Gin Ala 
1 5 

<210> 300 
<211> 286 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 196. .285 

<22l> sig_peptide 
<222> 196. .252 



60 
112 

139 
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<223> Von Heijne matrix 

score 5.80000019073486 
seq SPYFLASSSLSTS/AS 

<221> misc_feature 

<222> 16,286 

<223> ns=a, g, c or t 



<400> 300 

asatcgcgct gggganasgc cacgtcgcta tgagtgtgtt tcagtctacc tggattaaac 60 
gtttgcttct cttcgtctac cttgattaaa cgtgcacttc gcagtcctcg gttctccata 120 
cccgtgacct ggggatcgct acggacctta aaatacccgc aacascccct tcgtsccaag 180 
ctggagagca gtggc atg ate teg get cac tgc age ttc tac ttc ctg gee 231 
Met lie Ser Ala His Cys Ser Phe Tyr Phe Leu Ala 
-15 -10 
tea age agt ett tec ace tea ges tct saa cgc aet gga att aea gat 279 
Ser Ser Ser Leu Ser Thr Ser Ala Ser Xaa Arg Thr Gly lie Thr Asp 

-5 15 
gtg age n 2 86 

Val Ser 
10 

<210> 301 
<211> 242 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 113. .241 

<221> sig_peptide 

<222> 113 . .184 

<223> Von Heijne matrix 

score 5.69999980926514 

seq CLFVFLYTTPCNC/FG 

<400> 301 

tgcaatgggt ggtgttttat aatgctetct tccctaatca tgttaatcag gagattttcc 60 
ttttggaact cctgactgaa agcttettag tttaeaeaca tgttccteea gg atg aae 118 

Met Asn 

get gaa aat aac ttt ttc ggt ttt gtt tgt ttg ttt gtt ttc etc tat 166 
Ala Glu Asn Asn Phe Phe Gly Phe Val Cys Leu Phe Val Phe Leu Tyr 

-20 -15 -10 

aca ace cct tgc aat tgc ttt ggt tta gaa cac ctt tgg att eta agt 214 
Thr Thr Pro Cys Asn Cys Phe Gly Leu Glu His Leu Trp lie Leu Ser 

-5 15 10 

ttc atg gtt gtt etg gga gwy aee agg g 242 
Phe Met Val Val Leu Gly Xaa Thr Arg 
15 

<210> 302 
<211> 136 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 42. .134 



<221> sig_peptide 
<222> 42 . .110 
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<223> Von Heijne matrix 

score 5.69999980926514 
seq LPCCCHLLTCVSS/LR 

<400> 302 

agtcacagtg acacagcctt ccaaccaggc cgccccctgg c atg acc atg get gtg 56 

Met Thr Met Ala Val 
-20 

ggt gca get gmy cam etc ccc tgc tgc tgc cat ttr etc acc tgc gtni 104 
Gly Ala Ala Xaa Xaa Leu Pro Cys Cys Cys His Leu Leu Thr Cys Val 

-15 -10 -5 

tec age ctt cgc amt gae att tae cca cat gg 136 
Ser Ser Leu Arg Xaa Asp lie Tyr Pro His 
1 5 

<210> 303 
<211> 175 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 73. .174 

<221> sig_peptide 

<222> 73. ,147 

<223> Von Heijne matrix 

score 5.69999980926514 

seq SILL7V7UjSRNISP/GQ 

<400> 303 

aagaggaagc ggaakdgect caggtgggcg gtagtgeeaa aagcccaggg cgtccgcgca 60 
aaccgaggcg te atg egg aga aaa agg cga gaa aga aaa gag agg aag age 111 
Met Arg Arg Lys Arg Arg Glu Arg Lys Glu Arg Lys Ser 
-25 -20 -15 

ate etc etg gee gee ctt teg agg aac ata agt ect ggt cag aca tac 159 
lie Leu Leu Ala Ala Leu Ser Arg Asn lie Ser Pro Gly Gin Thr Tyr 

-10 -5 1 

cga aca tec ccc gcg g 175 
Arg Thr Ser Pro Ala 
5 

<210> 304 
<211> 493 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 402. .491 

<221> sig_peptide 

<222> 402. .470 

<223> Von Heijne matrix 

score 5.69999980926514 

seq LELLTSSDPPSLA/SQ 

<400> 304 

ttaggtgttc tgatagttaa gtggtagtat catggtctta atttttcctt gaagtggctt 60 

ttgatttgca tttcettaat gactaattag gttgagcatc ttttcatgta cttactggec 120 

ttetttggag aaataccttt tccaaatcea atgggttgtc tttttttatt gttgatetta 180 

agggttctta ggtgttctgg gtaccagttt cttgtgagat gtgtgacttg taaatacttt 240 

cttccattct ccatgttgtc tttttattct cttgatggta ttctttgaaa tacaaaartk 300 
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tttatatttg acaaagttca gtttatttat ttatttattg ccattcgtgc ttttggtttt 360 
gataatccat ttttwttgtt tttattttta tttacttaga g atg ggg tct ccc tat 416 

Met Gly Ser Pro Tyr 
-20 

gtt gcc cac gtt ggt ctt gaa etc ttg acc tea agt gat cet ccc tec 464 
Val Ala His Val Gly Leu Glu Leu Leu Thr Ser Ser Asp Pro Pro Ser 

-15 -10 -5 

ttg gcc tee eaa gtg ctg gga ata eat tm 493 
Leu Ala Ser Gin Val Leu Gly lie His 
1 5 

<210> 305 
<211> 214 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 79. .213 

<221> sigjpeptide 
<222> 79. .135 
<223> Von Heijne matrix 

score 5.69999980926514 

seq VCWLTLTLAHSHS/LT 

<400> 305 

eacacacgca ccaaatacac acagasaeec tggecetcac teacgeacav teteteaeae 
tegtggacac acccccag atg cat ctt tac act eat gta tgc tgg etc act 

Met His Leu Tyr Thr His Val Cys Trp Leu Thr 
-15 -10 
etc aca ctg gca cac tea cac age ttg acc cac acg cac aca etc aca 
Leu Thr Leu Ala His Ser His Ser Leu Thr His Thr His Thr Leu Thr 

-5 15 
ccc agt cac aca cgt aca cac tea eat acg tgt get tgc eta cac gca 
Pro Ser His Thr Arg Thr His Ser His Thr Cys Ala Cys Leu His Ala 

10 15 20 

cac aag g 
His Lys 
25 

<210> 306 
<211> 458 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 306. .458 

<221> sigjpeptide 
<222> 306. .350 
<223> Von Heijne matrix 

score 5.69999980926514 

seq LSLTFYHFPLCWG/HQ 

<221> misc_feature 
<222> 286,448 
<223> n=a, g, c or t 



60 
111 

159 

207 

214 



<400> 306 

atcagagage gecggaageg gtccgagaat gaagcagtgt gatetaccat geattgtcte 60 
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agaaagaggc gaatgactcc gatgtccagg tcagttcttg gcagggagtc caggagcaac 120 
agaggtgatg gcaaagatgg ctcagtagct tctgagcccc cagcactgat tgagatgtcc 180 
tttcccacat catactcctc atttttctgg cagacatcta aggctggatc aaagtctgta 24 0 
gttctcatta cctgttccca cgtgccagcc tccttttctg ttgtgnmmaa gtcaagtttg 300 
gtaaa atg agg ctt tec tta acc ttt tat cat ttc cca ctg tgt tgg gga 350 

Met Arg Leu Ser Leu Thr Phe Tyr His Phe Pro Leu Cys Trp Gly 

-15 -10 -5 

cac cag get gtg cec acg tgg tgg saa rgc ate att caa cct tgt cac 398 
His Gin Ala Val Pro Thr Trp Trp Xaa Xaa He He Gin Pro Cys His 
15 10 15 

tgt gee etc tge act tct gca gaa ggt gtg caa tea cat ate ata agt 44 6 

Cys Ala Leu Cys Thr Ser Ala Glu Gly Val Gin Ser His He He Ser 

20 25 30 

gna att tac aga 458 
Xaa He Tyr Arg 
35 



<210> 307 
<211> 328 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 87 . .326 



<221> sigjpeptide 

<222> 87. .128 

<223> Von Heijne matrix 

score 5.69999980926514 
seq NVLIIVFVAFAFG/FL 



<400> 307 

tatcttctct ccagtctaaa gcctcactga acaaactgtc cttgactgtc agtgctcagg 60 
gaactgctct gccacccttc tcctca atg aat gtg tta ate att gtt ttt gtt 113 

Met Asn Val Leu He He Val Phe Val 
-10 



gca ttt 


get 


ttt 


999 


ttc 


ytg 


gtc 


atg 


aag 


tct 


ttg 


ctt 


aag 


cca 


atg 


161 


Ala Phe 


Ala 


Phe 


Gly Phe 


Leu 


Val 


Met 


Lys 


Ser 


Leu 


Leu 


Lys 


Pro 


Met 




-5 








1 








5 










10 






teg aga 


agg 


gtt 


ttt 


ctg 


atg 


tta 


tct 


tct 


agg 


att 


ttt 


atg 


gtt 


tea 


209 


Ser Arg 


Arg Val 


Phe 


Leu 


Met 


Leu 


Ser 


Ser Arg 


He 


Phe 


Met 


Val 


Ser 








15 










20 










25 








ggt ctt 


aga 


ttt 


aag 


tec 


ttg 


ate 


cat 


ctt 


gag 


ttg 


att 


ttt 


gta 


tat 


257 


Gly Leu 


Arg 
30 


Phe 


Lys 


Ser 


Leu 


He 
35 


His 


Leu 


Glu 


Leu 


He 
40 


Phe 


Val 


Tyr 




aag ttg 


aga 


gat 


gag 


gat 


cca 


gtt 


tea 


ttc 


ttc 


tac 


atg 


tgg 


ctt 


gcc 


305 


Lys Leu 


Arg 


Asp 


Glu 


Asp 


Pro 


Val 


Ser 


Phe 


Phe 


Tyr Met 


Trp 


Leu 


Ala 




45 










50 










55 












aat tat 


cec 


age 


acc 


att 


tgt 


tg 


















328 


Asn Tyr 


Pro 


Ser 


Thr 


He 


Cys 






















60 








65 

























<210> 308 
<211> 380 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 33. .380 



<221> sigjpeptide 



wo 99/53051 PCT/IB99/0071 2 

172 

<222> 33. .92 

<223> Von Heijne matrix 

score 5.69999980926514 

seq LAWAIiPSLLRLGA/AQ 

<221> misc_f eature 
<222> 326 

<223> n=a, g, c or t 



<400> 308 



agcggtctcc cggcccctgc cgccctgcca ct atg 


tec 


cgc 


cgc 


tct 


atg 


ctg 


















Met 


Ser Arg 


Arg 


Ser 


Met 


Leu 


















-20 










-15 






ctt gcc 


tgg 


get 


etc 


ccc 


age 


etc 


ctt ega 


etc 


gga 


gcg 


get 


cag 


gag 


101 


Leu Ala 


Trp Ala 


Leu 


Pro 


Ser 


Leu 


Leu Arg 


Leu Gly 


Ala 


Ala 


Gin 


Glu 








-10 










-5 








1 








aca gaa 


gac 


ccg 


gcc 


tgc 


tgc 


age 


ccc ata 


gtg 


ccc 


egg 


aac 


gag 


tgg 


149 


Thr Glu 


Asp 


Pro 


Ala 


Cys 


Cys 


Ser 


Pro lie 


Val 


Pro 


Arg Asn Glu 


Trp 




5 










10 








15 












aag gcc 


ctg 


gca 


tea 


gag 


tgc 


gcc 


cag cac 


ctg 


age 


ctg 


ccc 


tta 


cgc 


197 


Lys Ala 


Leu 


Ala 


Ser 


Glu 


Cys 


Ala 


Gin His 


Leu 


Ser 


Leu 


Pro 


Leu 


Arg 




20 








25 








30 










35 




tat gtg 


gtg 


gta 


teg 


cac 


acg 


gcg 


ggc age 


age 


tgc 


aac 


ace 


sec 


gcc 


245 


Tyr Val 


Val 


Val 


Ser 
40 


His 


Thr 


Ala 


Gly Ser 
45 


Ser 


Cys 


Asn 


Thr 


Xaa 
50 


Ala 




teg tgc 


cag 


cag 


cag 


gcc 


egg 


aat 


gtg cag 


cac 


tac 


cac 


atg 


aag 


aca 


293 


Ser Cys 


Gin 


Gin 


Gin 


Ala 


Arg Asn Val Gin 


His 


Tyr 


His 


Met 


Lys 


Thr 








55 










60 








65 








ctg ggc 


tgg 


tgc 


gac 


gtg 


ggc 


tac 


aac tkc 


ctn 


gat 


tgg 


aga 


aga 


egg 


341 


Leu Gly 


Trp 


Cys 


Asp 


Val 


Gly Tyr Asn Xaa 


Leu 


Asp 


Trp Arg Arg Arg 






70 










75 








80 










get cgt 


ata 


era 


ggg 


ccg 


tgg 


mtg 


gaa ctt 


cac 


ggg 


tsc 








380 


Ala Arg 


lie 


Xaa 


Gly 


Pro 


Trp 


Xaa 


Glu Leu 


His 


Gly 


Xaa 











85 90 95 



<210> 309 
<211> 284 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 228. .284 

<221> sig_peptide 

<222> 228- .269 

<223> Von Heijne matrix 

score 5.69999980926514 

seq VFLFLMISVFAGC/QI 

<400> 309 

aaaagaagaa agctgaatea tactcgatga ttattgatca tttgtataea gctcaagcee 60 
tcaagtagee tgctgtaata tttactagtt acaaagaaaa gattcgtttt gteacagtta 120 
catgaaaggt gettatattt geaaatatgg agaeaaagtt catcttaaaa gattaaaatg 180 
agaatctcct aaatgaagca tttggaatat tgattagtat accagaa atg gtt ttt 236 

Met Val Phe 

ctt ttt ctt atg ate age gtt ttt gcc ggt tgt caa ate cct tec ggg 284 
Leu Phe Leu Met lie Ser Val Phe Ala Gly Cys Gin He Pro Ser Gly 
-10 -5 15 



<210> 310 
<211> 357 



wo 99/53051 



PCT/IB99/00712 



173 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 243 . .356 



<221> sig_peptide 

<222> 243. .305 

<223> Von Heijne matrix 

score 5.59999990463257 
seq AGLELLASSNSSA/LP 



<400> 310 

ttgagatcac ctgaggcaac atagtgaaac cctgtatcta gaataaatta gagaaagaaa 60 
aatagtctgg gcatgatggt gtgcacctat agtctccagc tabtcasgag cctgaggcag 120 
gaggwtcact tgagctkagg agttcaagga tgcagtsacc tgtgattgca ccactgcatt 180 
ccagcttgga caacagagtg agaccctgtc ttaaaattta aattttktgt yttwtggtag 24 0 
ag atg ggg tct cgc cct gtt tec gak get ggt etc gaa etc ctg gee 287 
Met Gly Ser Arg Pro Val Ser Xaa Ala Gly Leu Glu Leu Leu Ala 
-20 -15 -10 

teg age aat tct tct gee ttg ccc tte caa tgt tct ggg att aca ggc 335 
Ser Ser Asn Ser Ser Ala Leu Pro Phe Gin Cys Ser Gly lie Thr Gly 

-5 1 5 10 

atg age crc cac ace eta gcg g 357 
Met Ser Xaa His Thr Leu Ala 
15 



<210> 311 
<211> 470 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 413 . .469 



<221> sigjpeptide 

<222> 413 . .451 

<223> Von Heijne matrix 

score 5.59999990463257 
seq MLCHLSLVFLGXG/QF 

<221> misc_feature 
<222> 30 

<223> n=a, g, c or t 



<400> 311 

ccgttaacgg gattctggaa tttgttaggn taattgcttt tcaatatcaa gagatctggc 60 
aatcaaattt aataatatca agcttgcttg gtgagcatgg atttataaga tagaatggtt 120 
tgtgggggrg artatagtka cgaaaaagrk tattgtttcc cataatgcet ggtattgtat 180 
taagtacttt gcatacagta gggcatttca ttgtcccagt gatcctectg caaagtaggt 240 
acaattatct teaatttaca aatgaggaaa ccaagetctc ttcaagctga taagatgctg 300 
aactgagatt tgaaccaagt ccctctgccc ctaagagccc ctacccctag ctgetactat 360 
atgctgtacc catctaagct ttgtgaaata rccttgttcc actgcagaga ag atg ttg 418 

Met Leu 

tgt cac eta tct eta gta ttt ctt ggc ktt ggg cag ttc tgg agt caa 466 
Cys His Leu Ser Leu Val Phe Leu Gly Xaa Gly Gin Phe Trp Ser Gin 

-10 -5 15 

aat g 470 
Asn 



wo 99/53051 

<210> 312 

<211> 187 

'<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 98. .187 

<221> sig_peptide 
<222> 98. .148 
<223> Von Heijne matrix 

score 5.59999990463257 

seq FMCLFAICISSNA/KC 

<400> 312 

aagtttgttt ttgttggtgg tggtgatagt ttcttatatt ctattccata aagtatgaaa 60 
tggaggctcc ttgtgatttt taatttgcac ttctgta atg act aat ctt ttc atg 115 

Met Thr Asn Leu Phe Met 
-15 

tgc ttg ttt gcc ate tgt ata tct tct aat gcg aag tgt ctg ttt agt 163 
Cys Leu Phe Ala lie Cys lie Ser Ser Asn Ala Lys Cys Leu Phe Ser 

-10 -5 15 

ett ttt cct ttt ttt att gag ggg 187 
Leu Phe Pro Phe Phe lie Glu Gly 
10 

<210> 313 
<211> 237 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 93. .236 

<221> sig_peptide 

<222> 93- .173 

<223> Von Heijne matrix 

seore 5.59999990463257 

seq CVLFTLLVSTRSG/RS 

<221> mise_feature 

<222> 111 

<223> n=a, g, e or t 



PCT/IB99/007I2 

174 



<400> 313 

ttgettagga ttttctaaaa gattacataa aatactgttg aaaagatgat tgcatacaaa 60 
acataatctg ttcattgtta aacgtatacg aa atg ttg gga tac ate tgg naa 113 

Met Leu Gly Tyr lie Trp Xaa 

-25 

caa gac aaa gtc ttt get aat tgt gtt eta ttt aeg etc tta gtg tet 161 

Gin Asp Lys Val Phe Ala Asn Cys Val Leu Phe Thr Leu Leu Val Ser 

-20 -15 -10 -5 

aca aga tec ggg aga teg egs gcg ggt tgt gcc tgg agg tgg agg gga 209 

Thr Arg Ser Gly Arg Ser Arg Ala Gly Cys Ala Trp Arg Trp Arg Gly 

15 10 
aga tgg tea gta gga cag aag ggc hgg g 237 
Arg Trp Ser Val Gly Gin Lys Gly Xaa 
15 20 



<210> 314 



wo 99/53051 



PCT/IB99/00712 



175 

<211> 356 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 272. .355 

<221> sigjpeptide 

<222> 272. .316 

<223> Von Heijne matrix 

score 5.59999990463257 

seq LILSLQVCRPATL/DQ 

<221> misc_feature 
<222> 275. .276 
<223> n=a, g, c or t 



<400> 314 

ggatttgctt tctttttctc caaaagggga ggaaattgaa actgagtggc ccacgatggg 60 
aagaggggaa agcccagggg tacaggaggc ctctgggtga aggcagaggc taacatgggg 120 
ttcggagcga ccttggccgt tggcctgacc atctttgtgc tgtctgtcgt cactatcatc 180 
atctgcttca cctgctcctg ctgctgcctt tacaagacgt gccgccgacc acgtccggtt 240 
gtcaccacca ccacatccac cactgtggtg c atg nnc ctt ate etc age etc 292 

Met Xaa Leu lie Leu Ser Leu 
-15 -10 
caa gtg tgc cgc oca get acc ctg gae caa get acc agg get ace aca 340 
Gin Val Cys Arg Pro Ala Thr Leu Asp Gin Ala Thr Arg Ala Thr Thr 

-5 15 
cca tgc cgc eta egg g 356 
Pro Cys Arg Leu Arg 
10 



<210> 315 
<211> 162 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 40. .162 



<221> sigjpeptide 

<222> 40. .150 

<223> Von Heijne matrix 

score 5.59999990463257 
seq VLMLSLPLPPTPQ/QA 



<400> 315 

tacatgtgca gaatgtgcag atttgtcaca taggtgtgt atg tgc cac agg cgt 54 

Met Cys His Arg Arg 
-35 

tgg ctg cac eta tea acc cgt cat eta ggt ttt aag cec cgc ate cat 102 
Trp Leu His Leu Ser Thr Arg His Leu Gly Phe Lys Pro Arg lie His 

-30 -25 -20 

tac gta ttt gtc tta atg ctg tec etc cec ttg ccc cec acc ecc caa 150 
Tyr Val Phe Val Leu Met Leu Ser Leu Pro Leu Pro Pro Thr Pro Gin 

-15 -10 -5 

cag gee etc ggg 162 
Gin Ala Leu Gly 
1 



wo 99/53051 
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176 

<210> 316 
<211> 404 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 297. .404 

<221> sig_peptide 

<222> 297. .353 

<223> Von Heijne matrix 

score 5.59999990463257 

seq FVIFPAALLLCWG/GL 

<400> 316 

taagctgaaa aagaatataa aaattaaaga gaaattgaaa atctaagtct tgcagtgaga 60 
atgaccagaa atcgtttccc tctctggggg gttcctgttt aatatgaaag tcctcttaac 120 
aagcgtggac agaggaagtt ttaggtttga tttgaacttc atgtacatga catatttcat 180 
ttttttttct tccctcacaa atttcaaccc aggccacttg tttgcagaga ctgccaaacc 240 
ttccattgct gcttccaaga tactcctgga atctgagatt accttttatc ctcttg atg 299 

Met 

gac cat gtt gtt att ttt gtc att ttc cct gca get ctt ctg ctt tgc 347 
Asp His Val Val lie Phe Val lie Phe Pro Ala Ala Leu Leu Leu Cys 

-15 -10 -5 

tgg gga gga etc ate ccc eta tge ate ate tac ccc ccg ata get gae 395 
Trp Gly Gly Leu lie Pro Leu Cys lie lie Tyr Pro Pro lie Ala Asp 

15 10 
aca gtt ggg 4 04 

Thr Val Gly 
15 

<210> 317 
<211> 450 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 359. .448 

<22l> sig_peptide 

<222> 359. .433 

<223> Von Heijne matrix 

score 5.59999990463257 

seq LIXILXFDIYSLA/FI 

<221> mise_feature 

<222> 323,410 

<223> n=a, g, e or t 



<400> 317 

tatgtctttt gaatttgtga tgtacatatt aacagtagat taagttgaaa taataaaatc 60 

tgtattgttt atgatttatc agttatatga tgagtagaat atagtctatt gtggscmagt 120 

gtgtatatat aacataaaea atacattaac ccaattttgt gtgaaaatta ttttgggaec 180 

tagtagcttt cttggtcaca acctttcaaa caaacaaatt ttttttaaat taattttttc 24 0 

ccttaataaa gaaaacaatt cctcaatgtg taatagcaaa taccttttaa caggtcatat 300 

atcatcaatg ctttctttga aancgtactg atgcttacaa gatgctttac gagtaaag 358 

atg ctt aca aat ctt ttc ttt caa gta get cat cct ctg ate att att 406 
Met Leu Thr Asn Leu Phe Phe Gin Val Ala His Pro Leu He He He 
-25 -20 -15 -10 

Ctg ntg ttt gat ate tac tec eta gca ttt ate cat gac gtg gg 450 



wo 99/53051 
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177 



Leu Xaa Phe Asp He Tyr Ser Leu Ala Phe 
-5 1 



He His Asp Val 
5 



<210> 318 

<211> 395 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 313. .393 



<221> sig_j>eptide 

<222> 313 . .354 

<223> Von Heijne matrix 

score 5.59999990463257 
seq LFGLRGMLPLTQQ/AP 



<400> 318 

aatcgaaaac agcaaatcac acaactgtta aaaatatttt vtgtttacaa agccaagcca 
aaattttatg ttttcctccc caaactttga tataaacact aacatttttt agcatgtata 
aacatcatta ttaaccagtg tcctattaaa actccttttc tatgatagaa tgtctgttrc 
ttttaggtgg ataaggccta gatgattggc ctctaccagc atcctcatct ctgtccctga 
tgcccagctt carcctcgct cctgyatgct ggaccgcttc agtghagctc tcagacttgc 
tctgtgtctc ac atg cty ttt ggc tta cgt gga atg etc cca etc acc cag 
Met Leu Phe Gly Leu Arg Gly Met Leu Pro Leu Thr Gin 
-10 -5 
caa get eec att cct cat tta aga tgt aaa ttg agt gte acc tc 
Gin Ala Pro He Pro His Leu Arg Cys Lys Leu Ser Val Thr 
15 10 



60 
120 
180 
240 
300 
351 



395 



<210> 319 
<211> 257 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 20. .256 



<221> sig_peptide 

<222> 20. .82 

<223> Von Heijne matrix 

score 5.59999990463257 
seq ACVYACVCASVSA/CV 



<400> 319 

catctgtgtg tgcgtgtgt atg cgt gtg tgt atg cgt ctg tgt gca tgt gtg 52 

Met Arg Val Cys Met Arg Leu Cys Ala Cys Val 
-20 -15 



tat 


gcg 


tgt 


gtg 


tgc 


gca 


tea 


gtg 


tct 


gca 


tgt 


gtg 


tat 


rtg 


tgt 


gta 


100 


Tyr Ala 


Cys 


Val 


Cys 


Ala 


Ser 


val 


Ser 


Ala 


Cys 


Val 


Tyr 


Xaa 


Cys 


Val 




-10 










-5 










1 








5 






tgt 


atg 


tst 


gtg 


cgc 


gcg 


cat 


ctg 


tgt 


gtg 


tgc 


atg 


tgt 


gta 


tgt 


atg 


148 


Cys 


Met 


Xaa 


Val 
10 


Arg 


Ala 


His 


Leu 


Cys 
15 


Val 


Cys 


Met 


Cys 


val 
20 


Cys 


Met 




tgt 


gtg 


cat 


etc 


tgt 


gtg 


tgc 


atg 


tgt 


gta 


tgt 


gtg 


tgt 


gca 


tct 


gtg 


196 


Cys 


Val 


His 

25 


Leu 


Cys 


Val 


Cys 


Met 

30 


Cys 


Val 


Cys 


Val 


Cys 

35 


Ala 


Ser 


Val 




tgt 


gtg 


tgc 


atg 


tgt 


gca 


tgc 


gtg 


tgt 


atg 


tgt 


gtg 


tgc 


gtg 


cgt 


gca 


244 


Cys 


Val 
40 


Cys 


Met 


Cys 


Ala 


Cys 
45 


Val 


Cys 


Met 


Cys 


Val 
50 


Cys 


Val 


Arg 


Ala 




tct 


gtg 


tgt 


gtg 


c 
























257 
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Ser Val Cys Val 
55 

<210> 320 
<211> 325 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 256. .324 

<221> sig_peptide 

<222> 256. .318 

<223> Von Heijne matrix 

score 5.59999990463257 

seq LIANLVLFISIAA/LR 

<400> 320 

accacgcctc ctccaagtcc cagcgaaccc gcgtgcaacc tgtccctaaa aaagccaaag 60 
cagtcactct ttacctccca ctttccctcc tcccagcctt tggcaaccac taatctactt 120 
tccgtgtata tggatttacc tattcaggac atttcatatg tcctttggtg actggcttct 180 
ttcactttgc acaatgtttt taaggttcat tcctgtcata gtgtgtgtca gtacgaaccc 24 0 
ctccttaacc atcta atg gtt ate acc tct aat agt tat etc ata gcc aat 291 
Met Val lie Thr Ser Asn Ser Tyr Leu lie Ala Asn 
-20 -15 -10 

ctt gtt tta ttt ata tct ate gcc gcc etc egg g 325 
Leu Val Leu Phe lie Ser lie Ala Ala Leu Arg 
-5 1 



<210> 321 
<211> 201 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 31. ,201 

<221> sig_jpeptide 
<222> 31. .183 
<223> Von Heijne matrix 
score 5 . 5 

seq LSLHASLVTKAFS/IN 



<400> 321 

catcacaaga aeccagagtg gaattctggg atg gaa gag etg gac aga aag tgg 54 

Met Glu Glu Leu Asp Arg Lys Trp 
-50 -45 
aga gag aag gtc etc cea gcg gca aag eta att aaa agg aga aae etg 102 
Arg Glu Lys Val Leu Pro Ala Ala Lys Leu lie Lys Arg Arg Asn Leu 

-40 -35 -30 

ttt tec aea tgc act cct caa tat ggy aca cat get get ttc ttg tea 150 
Phe Ser Thr Cys Thr Pro Gin Tyr Gly Thr His Ala Ala Phe Leu Ser 

-25 -20 -15 

tta eat gcc tea ctt gtc acc aaa gca ttt tea ate aat tec tgg gag 198 
Leu His Ala Ser Leu Val Thr Lys Ala Phe Ser He Asn Ser Trp Glu 

-10 -5 15 

tgg 201 
Trp 



<210> 322 
<211> 159 



wo 99/53051 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 
<222> 77 . . 157 

<221> sig_peptide 
<222> 77. .151 
<223> Von Heijne matrix 
score 5.5 

seq PLLLCPLSSGSPC/PR 
<400> 322 

aacaaaggga cagaatggtc ccagggttcc ttcttcttcc ttccagttaa gagctcagag 60 
tggaagtggg ctgggg atg gtg teg ggg gcc caa get cee age tec caa agg 112 
Met Val Ser Gly Ala Gin Ala Pro Ser Ser Gin Arg 

-25 -20 -15 

ccc ctg ctt eta tgc cot ttg age tea ggt age ccc tgc ccc egg gg 159 
Pro Leu Leu Leu Cys Pro Leu Ser Ser Gly Ser Pro Cys Pro Arg 
-10 -5 1 

<210> 323 
<211> 420 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 325. .420 

<221> sig_peptide 
<222> 325 . .405 
<223> Von Heijne matrix 
score 5.5 

seq SFLPSLLSSFLLS/LP 

<221> misc_feature 
<222> 117 

<223> n=a, g, c or t 



<400> 323 

catgcaggat agtaatacgt tagaatcaaa aataaggtta tacttagaaa atattgattt 60 
gcctttttga ttttgcatgt gtataatetg getctgaaat cagtgacacg aagtganctt 120 
cgaaacaagc ctgagcaata gaagtagatg tggaaataac tteggtttet caaggeaaat 180 
aetttgatag gaacaaacaa ccgtttagat atagaagatg tgatacattc ctttaaaaag 240 
aatttgaect tatgteattg taggeaeacc tcatatttca attatteata tagtttttet 300 
tgagcaattg ctggtttaag aata atg tea tgt ett ttg cgt get tat ate 351 

Met Ser Cys Leu Leu Arg Ala Tyr lie 
-25 -20 
att tgg ata ttt ect tec ttc ctt cct tec etc ctt tet tec ttc ctt 399 
lie Trp lie Phe Pro Ser Phe Leu Pro Ser Leu Leu Ser Ser Phe Leu 

-15 -10 -5 

ctt tec etc ccc cct tec ggg 420 
Leu Ser Leu Pro Pro Ser Gly 
1 5 

<210> 324 
<211> 210 
<212> DNA 
<213> Homo sapiens 



wo 99/53051 



PCT/IB99/00712 



180 

<220> 
<221> CDS 
<222> 9. .209 



<221> sig_peptide 
<222> 9. .116 
<223> Von Heijne matrix 
score 5.5 

seq LHFVYCFLCCAEA/FL 



<400> 324 

ctccttat atg ttt cag tta ctg ate ctt tgt cag atg aat agt ttg aaa 50 

Met Phe Gin Leu Leu He Leu Cys Gin Met Asn Ser Leu Lys 

-35 -30 -25 



ata 


ttt 


tct 


ccc 


att 


ctt 


gga 


tgg 


tct ctt 


cat 


ttt 


gtt 


tat 


tgt 


ttc 


He 


Phe 


Ser 


Pro 


He 


Leu Gly Trp Ser Leu His 


Phe 


Val 


Tyr Cys 


Phe 






-20 










-15 








-10 








ctt 


tgc 


tgt 


gca 


gaa 


gcc 


ttt 


tta 


ctt gat 


atg 


ate 


eca 


ttt 


atg 


caa 


Leu 


Cys 
-5 


Cys 


Ala 


Glu 


Ala 


Phe 
1 


Leu 


Leu Asp 


Met 
5 


He 


Pro 


Phe 


Met 


Gin 
10 


ttt 


tac 


ttt 


ggt 


tac 


ctg 


tgc 


ttg 


tgg ggt 


att 


act 


tta 


aaa 


ate 


ttt 


Phe 


Tyr 


Phe 


Gly 


Tyr 


Leu 


Cys 


Leu 


Trp Gly 


He 


Thr 


Leu Lys 


He 


Phe 










15 








20 










25 




gcc 


cag 


tec 


aat 


tgg 


g 




















Ala 


Gin 


Ser 


Asn 
30 


Trp 























<210> 325 
<211> 192 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 31. .192 



<221> sig_j)eptide 
<222> 31. .174 
<223> Von Heijne matrix 
score 5 . 5 

seq VCLRLHVLSAVQT/ER 



<400> 325 

aggctgctgc agttggcgma tgaggcgacc atg gcc ttg ctg ggt aag cgc tgt 54 

Met Ala Leu Leu Gly Lys Arg Cys 
-45 

gac gtc ccc acm aac ggc tgc gga ccc gac cgc wgg aam wac ggc gwy 102 

Asp Val Pro Thr Asn Gly Cys Gly Pro Asp Arg Xaa Xaa Xaa Gly Xaa 

-40 -35 -30 -25 

aac ccg caa ara cga gat cat cac cag cmt mgt gtc tgc ctt aga etc 150 

Asn Pro Gin Xaa Arg Asp His His Gin Xaa Xaa Val Cys Leu Arg Leu 

-20 -15 -10 

cat gtg etc age get gtc car act gaa cgc cga ggt gat ggg 192 
His Val Leu Ser Ala Val Gin Thr Glu Arg Arg Gly Asp Gly 
-5 1 • 5 



<210> 326 

<211> 181 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 



wo 99/53051 

<222> 71. .181 



181 



PCT/IB99/00712 



<221> sig_j)eptide 
<222> 71. .166 
<223> Von Heijne matrix 
score 5.5 

seq TLALLSSDSVATG/SV 



<400> 326 

aatttcgcgg cctagtgggg cgtacgggcc tcttttgaaa gcctgagtta cgatgtattg 60 
agcgcgtcgt atg egg cca gca eta agg tec ttc tgg eac tec tct ggt 109 
Met Arg Pro Ala Leu Arg Ser Phe Trp His Ser Ser Gly 

-30 -25 -20 

gga cog ccc cca teg gcc aca ctt gcc ctg etc tec agt gat tct gta 157 
Gly Pro Pro Pro Ser Ala Thr Leu Ala Leu Leu Ser Ser Asp Ser Val 

-15 -10 -5 

get act ggc tec gta gtc teg egg 181 
Ala Thr Gly Ser Val Val Ser Arg 
1 5 



<210> 327 
<211> 185 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 39. .185 



<221> sig^peptide 
<222> 39. .116 
<223> Von Heijne matrix 
score 5 . 5 

seq LFSGWLVWWGSRS/SQ 

<221> misc^feature 
<222> 143,145,175 
<223> n=a, g, c or t 



<400> 327 

caaagaegca ctacttagta cagagaggtt ttgaatac atg etc tgt gca tgc aag 56 

Met Leu Cys Ala Cys Lys 
-25 

gca cgt ggg gtg atg ctg ctg ctg ttc tea ggg tgg ttg gtt tgg tgg 104 

Ala Arg Gly Val Met Leu Leu Leu Phe Ser Gly Trp Leu Val Trp Trp 

-20 -15 -10 -5 

ggc. agt agg tec tea cag twe etc aga atg cet gag agn tna gta agt 152 

Gly Ser Arg Ser Ser Gin Xaa Leu Arg Met Pro Glu Xaa Xaa Val Ser 

15 10 
999 9^9 99^ cga age gat cdv dng cca cat ggg 185 
Gly Glu Gly Arg Ser Asp Xaa Xaa Pro His Gly 
15 20 



<210> 328 
<211> 210 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 57. .209 



wo 99/53051 
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182 

<221> sig_peptide 
<222> 57. .182 
<223> Von Heijne matrix 
score 5 . 5 

seq SLILPTSPSPAHS/GS 



<400> 328 

gacttaggts yaggcgactg cccagacaat gactggtccc gcataccgag cagagc atg 5 9 

Met 

ate age age agt etg agt gga aga gtg cct gtg ate tta ggg aac ctg 107 
lie Ser Ser Ser Leu Ser Gly Arg Val Pro Val He Leu Gly Asn Leu 

-40 -35 -30 

atg ggc gtt gga gca gcg gtt cga cgc atg ggt ttc tct tta ate ctt 155 
Met Gly Val Gly Ala Ala Val Arg Arg Met Gly Phe Ser Leu He Leu 
-25 -20 -15 -10 

eeg aet tee cea age cca gcg cac tea ggt tec get eca agt gcg gga 203 
Pro Thr Ser Pro Ser Pro Ala His Ser Gly Ser Ala Pro Ser Ala Gly 

-5 15 
ccc cgc g 210 
Pro Arg 



<210> 329 

<211> 318 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 149. .316 



<221> sig_j>eptide 
<222> 149- .286 
<223> Von Heijne matrix 
score 5 . 5 

seq ILLISTLFYSLLS/GS 



<400> 329 

acacttaaec catctgtttt etctaatgca cgacagattc ctttcagaca ggacaactgt 60 
gatatttcag ttcctgattg taaatacctc ctaagcctga agcttctgtt aetagccatt 120 
gtgrgcttea gtktcttcak yckgcaaa atg ggc ata ata car kct att ctt 172 

Met Gly He He Gin Xaa He Leu 
-45 -40 
gcc aca tea agg gat tgt tat tec ttt aaa aaa aaa cca ata cca aag 220 
Ala Thr Ser Arg Asp Cys Tyr Ser Phe Lys Lys Lys Pro He Pro Lys 

-35 -30 -25 

aag cet aea atg ttg gee tta gee aaa att etg ttg att tea acg ttg 268 
Lys Pro Thr Met Leu Ala Leu Ala Lys He Leu Leu He Ser Thr Leu 

-20 -15 -10 

ttt tat tea ctt eta teg ggg age cat gga aaa gra aat caa gac gtg 316 
Phe Tyr Ser Leu Leu Ser Gly Ser His Gly Lys Xaa Asn Gin Asp Val 

-5 15 10 

gg 318 



<210> 330 

<211> 223 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> 135. .221 



<221> sig_peptide 



wo 99/53051 
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183 

<222> 135. .203 
<223> Von Heijne matrix 
score 5.5 

seq LPFVCLLLRNVYS/DL 



<400> 330 

aacagtgtgt gagagttccc tttcctccac atcctcgcca gcatctgtta ttgcctgtct 60 
ttttgatacg agccttttta acaggggtaa gatgatatct cattgtagtt ttgatttgca 120 
ttctctgatg atca atg atg ttg age acc ttt tea tat gee tgt ttg cca 170 
Met Met Leu Ser Thr Phe Ser Tyr Ala Cys Leu Pro 
-20 -15 
ttt gta tgt ctt ctt ttg aga aat gtc tat tea gat ctt ttg ccc aat 218 
Phe Val Cys Leu Leu Leu Arg Asn Val Tyr Ser Asp Leu Leu Pro Asn 

-10 -5 15 

egg gg 223 
Arg 



<210> 331 
<211> 362 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 272. .361 



<221> sig_peptide 
<222> 272. .343 
<223> Von Heijne matrix 
score 5.5 

seq LIWLVCiSLVII/DD 



<400> 331 

aatggacacc taggttgctt ccatatctga gctattgtga ataatgctgc aatgaacatg 60 
ggagtggaga catctcctaa gcatactgat ttcagttcct ttgggtatat acccagaagt 120 
gggatcatgt ggtaatcttg tttttacttt tttgaggaac ctccatacca ttatccatga 180 
tggctatagt aatttacatt cataecagca gtgcacaagg gtctcctttt ctgtatacac 240 
ttgccaacac ttgttatctt tcattttttt g atg eta gcc att eta aca ggt 292 

Met Leu Ala He Leu Thr Gly 
-20 

999 ^99 tgg tat etc ata gtg gtt tta gtt tgc att tec ttg gtg att 340 
Gly Arg Trp Tyr Leu He Val Val Leu Val Cys He Ser Leu Val He 

-15 -10 -5 

att gat gat gat gag cac ggg g 362 
He Asp Asp Asp Glu His Gly 
1 5 



<:210> 332 

<211> 89 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 
<222> 34. .87 



<221> sig_peptide 
<222> 34 . .75 
<223> Von Heijne matrix 
score 5.5 

seq LLPLGLKVLGLQA/RG 



<400> 332 
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184 

cccagaccgg tcttgaactc ctggcctcaa ctg atg etc ctg cct ctg ggt etc 54 

Met Leu Leu Pro Leu Gly Leu 
-10 

aaa gtg ctg gga tta cag gcg aga gge ace acg ct 8£ 
Lys Val Leu Gly Leu Gin Ala Arg Gly Thr Thr 
-5 1 

<210> 333 
<211> 399 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 255. .398 

<221> sig_j)eptide 
<222> 255. .338 
<223> Von Heijne matrix 
score 5.5 

seq PTLLVMWLSPQMA/SS 
<400> 333 

ttcactgcaa ggcggcggca ggagaggttg tggtgctagt ttctctaagc catccagtgc 60 
catcctcgtc gctgcagcga cacacgctct cgcegccgcc atgactgagc agatgaccct 120 
tcgtggcacc ctcaagggcc acaacggctg ggtaacccag atcgctacta ccccgcagtt ISO 
cccggacatg atcctctccg cctctcgagg tacggactaa gataagacca tcatcatgtg 240 
gaaactgacc aggg atg aga cca act atg gaa ttc cac age gtg etc tgc 290 
Met Arg Pro Thr Met Glu Phe His Ser Val Leu Cys 
-25 -20 
ggg gtc act ccc act ttg tta gtg atg tgg tta tct cct cag atg gcc 338 
Gly Val Thr Pro Thr Leu Leu Val Met Trp Leu Ser Pro Gin Met Ala 

-15 >io -5 

agt teg ccc tct cag get cct ggg atg gaa ccc tgc gcc tct ggg ate 386 
Ser Ser Pro Ser Gin Ala Pro Gly Met Glu Pro Cys Ala Ser Gly He 
15 10 15 

tea caa egg gca a 299 
Ser Gin Arg Ala 
20 

<210> 334 
<211> 188 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 33.. 188 

<221> sig_peptide 
<222> 33. .131 
<223> Von Heijne matrix 
score 5 . 5 

seq SLCLLTVAVLVLT/PK 

<400> 334 

aatgaagggt actagaacac ctgcccatce at atg gga aaa aaa aaa ate tgg 53 

Met Gly Lys Lys Lys He Trp 
-30 

acc cct age tea tat ccc atg ccc agt cat aaa cat gta tec eta tgt lOl 
Thr Pro Ser Ser Tyr Pro Met Pro Ser His Lys His Val Ser Leu Cys 

-25 -20 -15 

ctt eta acg gtt gca gtt tta gtt ctt aca ttt aag tct tta att cat 149 
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185 

Leu Leu Thr Val Ala Val Leu Val Leu Thr Phe Lys Ser Leu lie His 
-10 -5 15 

ttt gag tda att ttt gca tat gag ata ggg gtc cag ggg 188 
Phe Glu Xaa He Phe Ala Tyr Glu He Gly Val Gin Gly 
10 15 

<210> 335 
<211> 115 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 23. .115 

<221> sig_peptide 
<222> 23. .94 
<223> Von Heijne matrix 
score 5 . 5 

seq CPSLLSPISPSQA/CP 
<400> 335 

ccaatacaca tcactcagtg gc atg age ect gtc etc tge ttc cat cgc tgc 52 

Met Ser Pro Val Leu Cys Phe His Arg Cys 
-20 -15 
tee tgt eee tec etc etc age cce ate tec cea tec cag gcc tgt cet 100 
Ser Cys Pro Ser Leu Leu Ser Pro He Ser Pro Ser Gin Ala Cys Pro 

-10 -5 1 

gag ccc etc ctt ggg 115 
Glu Pro Leu Leu Gly 
5 

<210> 336 
<211> 300 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 197. .298 

<221> sig_peptide 
<222> 197. .268 
<223> Von Heijne matrix 
score 5 . 5 

seq IMFVCMCVCVCVC/VY 
<400> 336 

catgcttgtt gtaacgtgtc aaacaataca gaggtgtagg gaaaatacct agtgccaccc 60 
tccactccaa aaccccatgt cgccagagat aaccatttat tcagacagtg agtatctatt 120 
aagtatctat tgctaggctt tggagatagc ataatgaaca aaatggatgt gctctetgcc 180 
cttgtgattt ggacag atg ctt cag tta tct ttt tct gtg ttt ata ttg att 232 
Met Leu Gin Leu Ser Phe Ser Val Phe He Leu He 
-20 -15 
atg ttt gta tgt atg tgc gtg tgt gtg tgt gtg tgt gtg tat cga ctg 280 
Met Phe Val Cys Met Cys Val Cys Val Cys Val Cys Val Tyr Arg Leu 

-10 -5 1 

ttt tct tec tec tec ecg gg 300 
Phe Ser Ser Ser Ser Pro 
5 10 



<210> 
<211> 



337 
307 
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186 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 5. .307 

<221> sig_peptide 
<222> 5. .277 
<223> Von Heijne matrix 
score 5.5 

seq RVLLGAGIPPVSS/AP 

<400> 337 

caca atg aag teg act gtt teg teg agg gaa gtg gee ace gtt gat aaa 4 9 

Met Lys Ser Thr Val Ser Ser Arg Glu Val Ala Thr Val Asp Lys 

-90 -85 -80 



atg 


aaa 


aga 


cge eat 


gea 


gaa 


tac 


tgt 


gea 


eag 


ggt 


etc 


eag aga ttt 


97 


Met 


Lys 


Arg 


Arg His 


Ala 


Glu 


Tyr 


Cys 


Ala 


Gin 


Gly Leu Gin Arg Phe 






-75 








-70 










-65 








aaa 


gcc 


caa 


Gtt tCt 


caa 


gat 


ace 


ett 


eec 


eav 


cat 


oca 


cat ctg gag 


145 


Lys 


Ala 


Gin 


Leu Ser 


Gin 


Asp 


Thr 


Leu 


Pro 


Xaa 


His 


Pro 


His Leu Glu 


-60 








-55 










-50 






-45 




awa 


gag 


aag 


ggg ett 


gaa 


ggc 


ttg 


gag 


gaa 


aat 


gtg 


cct 


eta aag gga 


193 


Xaa 


Glu 


Lys 


Gly Leu 


Glu 


Gly Leu Glu 


Glu 


Asn 


Val 


Pro 


Leu Lys Gly 










-40 










-35 








-30 




gag 


aaa 


cct 


gga gaa 


ggg 


ggt 


eca 


gag 


tct 


cct 


aag 


aag 


aga aga agq 


241 


Glu 


Lys 


Pro 


Gly Glu 


Gly 


Gly Pro Glu 


Ser 


Pro 


Lys 


Lys Arg Arg Arg 










-25 








-20 










-15 




gtg 


ett 


etc 


gga gcg 


ggc 


ate 


cea 


cea 


gta 


age 


tea 


get 


cce agg aga 


289 


Val 


Leu 


Leu 
-10 


Gly Ala Gly 


lie 


Pro 

-5 


Pro 


Val 


Ser 


Ser 


Ala 


Pro Arg Arg 




eag 


age 


eag 


eag gca 


aca 














1 




307 


Gin 


Ser 


Gin 


Gin Ala 


Thr 



















5 10 

<210> 338 
<2H> 123 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16. .123 

<221> sig_j)eptide 
<222> 16 . .75 
<223> Von Heijne matrix 
score 5 . 5 

seq VHLFFFFFXETGS/RS 
<400> 338 

ttaattaaac tgtgg atg eae aac agt tgt aga cct gtg cae ett ttt tte 51 
Met His Asn Ser Cys Arg Pro Val His Leu Phe Phe 
-20 -15 -10 

ttt ttt ttt yet gag aca ggt tct cgt tct aat ycc tgg ctg gag tsc 99 
Phe Phe Phe Xaa Glu Thr Gly Ser Arg Ser Asn Xaa Trp Leu Glu Xaa 

-5 1 5 

agt ggt gcg ate ata get aac tee 123 
Ser Gly Ala lie lie Ala Asn Ser 
10 15 



<210> 339 
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187 

<211> 451 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 318. .449 



<221> sig_peptide 

<222> 318. .443 

<223> Von Heijne matrix 

score 5.40000009536743 
seq TFRLYLSLPVSQA/GP 

<221> misc_f eature 
<222> 310. .311,394 
<223> n=a, g, c or t 



<400> 339 

gtcacaaaag gagcactaag agcctgcttt actttcttcc tcagttgagt cgtggggaca 60 
gcttgaagga gccaacctca attgcagaga gcagccgtca ccccagctac cgctcagagc 120 
ccagcttgga accagagagc ttccgttctc ctacctttgg caaaagtttt cacttcgatc 180 
cactatccag tggctcacgc tcctccagcc tcaagtcagc ccagggcaca ggctttgagc 240 
t999ccagtt gcaatccatt cgttcagagg gcaccacctc cacctcctaa taagagcctg 300 
gccaaccagn nacgcaa atg gaa gcc tat ctt aat gac age ttg etc aca 350 
Met Glu Ala Tyr Leu Asn Asp Ser Leu Leu Thr 
-40 -35 
cct tea gac age cct gat ttt gag tea gtg cag gca ggg ect gna gcc 398 
Pro Ser Asp Ser Pro Asp Phe Glu Ser Val Gin Ala Gly Pro Xaa Ala 

-30 -25 -20 

aga ccc acc ttt agg eta tac etc tec ctt cct gtc age cag get ggc 446 
Arg Pro Thr Phe Arg Leu Tyr Leu Ser Leu Pro Val Ser Gin Ala Gly 
-15 -10 -5 1 

cca gc 451 
Pro 



<210> 340 
<211> 304 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 94. .303 



<221> sig_peptide 

<222> 94. .135 

<223> Von Heijne matrix 

score 5.40000009536743 
seq PALGPALLQGSLX/RV 

<22l> misc_feature 
<222> 244 . .245 
<223> n=a, g, c or t 



<400> 340 

gcgcagggga gaaacaaggc gccttggagt teaggtgact cccacaeggg teatgetgtt 60 
gtctcctgat ccagccggcc ctgccaggtg acc atg cct get etg ggc cca get 114 

Met Pro Ala Leu Gly Pro Ala 
-10 

ctt etc cag ggc tct etg kgc cgv gtg ggt cct cac cct cca gcs cct 162 
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Leu 


Leu 


Gin 
-5 


Gly 


Ser 


Leu 


Xaa 


Arg 
1 


Val Gly Pro His 


Pro 


Pro 


Ala 


Pro 




tec 


acc 


aac 


tgc 


att 


cac 


tec 


caa 


5 

tgg cac gta tct 


gca 


gca 


csk 


ggc 


210 


Ser 


Thr 


Asn 


Cys 


He 


His 


Ser 


Gin 


Trp His Val Ser 


Ala 


Ala 


Xaa 


Gly 




10 










15 






20 








25 




aag 


gga 


ccc 


cac 


etc 


agg 


cac 


cct 


ctr set ggg iins 


tae 


caa 


ctt 


cct 


258 


Lys 


Gly Pro 


His 


Leu 


Arg 


His 


Pro 


Leu Xaa Gly Xaa 


Tyr 


Gin 


Leu 


Pro 












30 








35 






40 






gtt 


cca 


get 


gag 


ccc 


tgg 


get 


gca 


get gga ggc cac 


agt 


gtc 


cac 


c 


304 


Val 


Pro 


Ala 


Glu 


Pro 


Trp Ala Ala Ala Gly Gly His 


Ser 


Val 


His 












45 










50 




55 









<210> 341 

<211> 379 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 315. .377 

<221> sig_peptide 

<222> 315. .371 

<223> Von Heijne matrix 

score 5.40000009536743 

seq LCCSGCVPSLCCS/SY 

<400> 341 

gtagccgccg ccgaaacttc cgccgcegcg tcegecgcct ccggaactaa acggggtgag 60 
gteacattcg gttatetcta acgttggaaa acgatggagc taacacccat tatggagatt 120 
aamcvacttt teatcaggtt tttaacttaa gtcgtgagga atacaaeggt gaaeacaaga 180 
ttcattttat tttcatcacc atgggaegta tcetgttgtt gagttetctg ggtcagaect 240 
etgaagaett ctcagatgga teetagtctc wrrgcttgec ctgaaattac tcgctgetca 300 
gggagagagt tgaa atg gtt ggc ate etc eca etc tgt tgc tec ggc tgt 350 
Met Val Gly He Leu Pro Leu Cys Cys Ser Gly Cys 
-15 .-10 
gtc eec teg etc tgt tgt tec age tat gt 379 
Val Pro Ser Leu Cys Cys Ser Ser Tyr 
-5 1 

<210> 342 
<211> 289 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 223 . .288 

<221> sig_peptide 

<222> 223 . .264 

<223> Von Heijne matrix 

score 5.40000009536743 

seq AHSILLLASQAGC/LR 

<400> 342 

gggacccttt tagctatgaa atattttgga ttgegtaggg tcttgcgcag cgcgaaaagt 60 
agcgtgggee aggacagcgg gaggtaagtc gccaagaaaa gggttgggaa ragctcagaa 120 
tcggacggct aggaagaaat gaccaaaagg agcctgatag ccccetattc tgcacgctgt 180 
tcctggaaac cgcctttgca aagacagtga gagaaatcta ac atg get cac tec 234 

Met Ala His Ser 

ate ttg ctt eta gee teg cag gee ggc tgt ctt cgc tea tte etg ggc 282 
He Leu Leu Leu Ala Ser Gin Ala Gly Cys Leu Arg Ser Phe Leu Gly 
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189 



-10 

aat tgg g 
Asn Trp 



289 



<210> 343 
<211> 169 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 78. .167 



<221> sig_peptide 

<222> 78. .137 

<223> Von Heijne matrix 

score 5.40000009536743 

seq WVFLVAIFKGVHC/EG 

<400> 343 

agctctggga gaggagcccc cgccctggga ttcccaggtg ttttcatttg gtgatcagca 60 
ctgaacacag aagagtc atg acg gag ttt ggg ctg age tgg gtt ttc ctt 110 
Met Thr Glu Phe Gly Leu Ser Trp Val Phe Leu 
-20 -15 -10 

gtt get att ttt aaa ggt gtc cac tgt gaa ggt cma att ggt gga gtc 158 
Val Ala He Phe Lys Gly Val His Cys Glu Gly Xaa He Gly Gly Val 
-5 1 5 

ggg ggg gcg gg 

Gly Gly Ala 
10 



<210> 344 
<211> 112 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 63. .110 



<221> sig_peptide 

<222> 63. .104 

<223> Von Heijne matrix 

score 5.40000009536743 

seq NTVFLLLFFGCFF/FE 

<400> 344 

tgtgttttct ctgtcccaaa ttaaatgcat tggggaagtt tataattaca ggaattccac 60 
go atg aac act gtt ttt ttg ttg ttg ttt ttt ggt tgt ttt ttt ttt 107 
Met Asn Thr Val Phe Leu Leu Leu Phe Phe Gly Cys Phe Phe Phe 
-10 -5 1 

gag ac 
Glu 



<210> 345 
<211> 349 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 207. .347 



wo 99/53051 PCT/IB99/00712 

190 

<221> sig_peptide 

<222> 207. .278 

<223> Von Heijne matrix 

score 5.40000009536743 

seq SCCCLSSSSFIAG/RR 



<400> 345 

tcatcgtcta cgtggacggc agctggagcc cgtggagcaa gtggtcggcc tgtgggctgg 60 
actgcaccca ctggcggacc gtgagtgctc tgacccagca ccccgcaacg gaggggagga 120 
gtgccagggc actgacctgg acacccgcaa ctgtaccagt gacctctgtg tacacactgc 180 
ttctggccct gaggacgtgg ccctct atg tgg gcc tea teg ecg tgg ccg tct 233 

Met Trp Ala Ser Ser Pro Trp Pro Ser 
-20 

gcb tgg tec tge tgc tgc ttg tec tea tec teg ttt att gee gga aga 281 

Ala Trp Ser Cys Cys Cys Leu Ser Ser Ser Ser Phe lie Ala Gly Arg 

-15 -10 -5 1 

agg agg ggc tgg act cag atg tgg etg act cgt cea ttc tea cet cag 329 

Arg Arg Gly Trp Thr Gin Met Trp Leu Thr Arg Pro Phe Ser Pro Gin 

5 10 15 

get tec age ccg tea gea te 349 
Ala Ser Ser Pro Ser Ala 
20 



<210> 346 
<211> 191 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 45. .191 



<221> sig_peptide 

<222> 45. .143 

<223> Von Heijne matrix 

score 5.40000009536743 
seq FMLIILSAILLNS/FI 



<400> 346 

ecttccatag gtggtgtacc gttttgcatt cccatcagca etgt atg aca atg ccc 56 

Met Thr Met Pro 
-30 

att tct tea tat tec cag aat gtg ttg tea aac ttt eac gat gge tat 104 
lie Ser Ser Tyr Ser Gin Asn Val Leu Ser Asn Phe His Asp Gly Tyr 

-25 -20 -15 

ttt atg tta att ata ctt tct gee att tta eta aat tct ttt att ggt 152 
Phe Met Leu lie lie Leu Ser Ala lie Leu Leu Asn Ser Phe He Gly 

-10 -5 1 

tgt gtc age ttt tat cat tge ttt tct tgg ggt tea ggg 191 
Cys Val Ser Phe Tyr His Cys Phe Ser Trp Gly Ser Gly 
5 10 15 



<210> 347 
<211> 229 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 144. .227 

<221> sigjpeptide 
<222> 144 . .203 



wo 99/53051 



PCT/IB99/007I2 



191 

<223> Von Heijne matrix 

score 5.40000009536743 
seq LSLVIFLLTVKHC/FR 

<400> 347 

tttcatatag ccacctcctc ttggtagcca gaagaccctt cggatgatgc cccaggtgta 
aaactctctg gggcccgccc cactcggaag gattactgaa atgagtcatt tccgggacgc 
cttttttact gttgaatgaa agg atg eta aca cat ggg get tee etg tet tta 

Met Leu Thr His Gly Ala Ser Leu Ser Leu 

-20 -15 
gtc ata ttt ctg tta aca gtg aag cat tgc ttt aga tac aga gta tac 
Val lie Phe Leu Leu Thr Val Lys His Cys Phe Arg Tyr Arg Val Tyr 
-10 -5 15 

aag act tt 
Lys Thr 

<210> 348 
<211> 210 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 106 . -210 

<221> sig_jc)eptide 
<222> 106. .171 
<223> Von Heijne matrix 

score 5.40000009536743 

seq FWTSIPILPLSSG/RQ 

<400> 348 

aaagaatcca gttgagccta tcgggacttt tgacctacag aactgtgaga taaaaaatgg 60 
gtgtcgtttt agataaccca tggcagcatt ccctcctctg ctgga atg teg tea gtg 117 

Met Ser Ser Val 
-20 

gag act gac tgg gga ttc tgg act tec ate eec ate etc cea etc age 165 
Glu Thr Asp Trp Gly Phe Trp Thr Ser lie Pro lie Leu Pro Leu Ser 

-15 -10 -5 

agt ggt agg cag etc ccc etc ccc act aga gaa tgg gga atg tgg 210 
Ser Gly Arg Gin Leu Pro Leu Pro Thr Arg Glu Trp Gly Met Trp 
1 5 10 

<210> 349 
<211> 431 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 184. .429 

<221> sig_peptide 

<222> 184. .282 

<223> Von Heijne matrix 

score 5.40000009536743 

seq LSAILSMLSLSFS/TT 

<221> misc_feature 
<222> 214 

<223> n=a, g, c or t 



60 

120 
173 



221 



229 
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<400> 349 

^SS^catcct ctccaatcca ccacacacca ccttacccct ctgctggcaa gaggggacct 60 

gattcatcct cacgctaaac actcattcta cccaactgat tgagacagaa cagaagataa 12 0 

ctgaaacttc tctgccttcc cgctgcaaga agtgaatgag cgatccctct caactgactk 180 



raa 


atg 


ttt 


gcc 


tea 


ccc 


agg 


aga 


tgg 


age 


tct 


ncg 


aag 


gcc 


ttc 


tct 


228 




Met 


Phe 


Ala 


Ser 


Pro 


Arg 


Arg 


Trp 


Ser 


Ser 


Xaa 


Lys 


Ala 


Phe 


Ser 












-30 










-25 










-20 






ggc 


cag 


^99 


aca 


etc 


eta 


tet 


gcc 


ate 


etc 


age 


atg 


eta 


tea 


etc 


age 


276 


Gly 


Gin 


ArQ 


Thr 


Leu 


Leu 


Ser 


Ala 


He 


Leu 


Ser 


Met 


Leu 


Ser 


Leu 


Ser 










-15 










-10 










-5 








ttc 


tec 


aca 


aca 


tec 


etg 


etc 


age 


aac 


tac 


tgg 


ttt 


gtg 


ggc 


aca 


cag 


324 


Phe 


Ser 


Thr 
1 


Thr 


Ser 


Leu 


Leu 
5 


Ser 


Asn 


Tyr 


Trp 


Phe 
10 


Val 


Gly Thr Gin 




aag 


gtg 


ccc 


aag 


ccc 


ctg 


tgc 


gag 


aaa 


ggt 


ctg 


gca 


gcc 


aag 


tgc 


ttt 


372 


Lys 


Val 


Pro 


Lys 


Pro 


Leu 


Cys 


Glu 


Lys 


Gly Leu Ala 


Ala 


Lys 


Cys 


Phe 




15 










20 










25 










30 




gac 


atg 


cca 


gtg 


tec 


ctg gat 




gat 


ace 


aac 


aca 


tec 


ace 


cag 


gag 


420 


Asp 


Met 


Pro 


Val 


Ser 


Leu Asp 


Gly Asp 


Thr 


Asn 


Thr 


Ser 


Thr 


Gin 


Glu 












35 










40 










45 






gtg 


gta 


mma 


ta 


























431 


Val 


Val 


Xaa 































<210> 350 
<211> 386 
<212> DMA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 197. .385 

<221> sig_j>eptide 

<222> 197. .244 

<223> Von Heijne matrix 

score 5.40000009536743 

seq HSVFLCAPALVFP/RP 

<400> 350 

aaagtaaage ggaggcagcg ggggaagatg gcggcggccg ttccacagcg ggcgtggaec 60 
gtggagcagc tgcgcagtga gcagctgccc aagaaggaca ttatcaagtt tctgcaggaa 120 
eacggtteag atteggtacc agaggcgtag gggcggecgg getggtgcgg ctgagggacg 180 
ceteacccec etggag atg ccc ata cat tec gta ttc etc tgt gcc ccc gcc 232 
Met Pro He His Ser Val Phe Leu Cys Ala Pro Ala 
-15 -10 -5 

etc gtc ttc ccg egg ceg gtg gcc tgg aag geg gag agg ccc age ttg 280 
Leu Val Phe Pro Arg Pro Val Ala Trp Lys Ala Glu Arg Pro Ser Leu 

15 10 
tgc ttt ggt gee teg etc ccg cct etc ggg egt tct eta etg ggg cag 328 
Cys Phe Gly Ala Ser Leu Pro Pro Leu Gly Arg Ser Leu Leu Gly Gin 

15 20 25 

ggg age age ttt att tct tgg ggc aca cag get gca att gta gag tta 376 
Gly Ser Ser Phe He Ser Trp Gly Thr Gin Ala Ala He Val Glu Leu 

30 35 40 

kaa cct cat t 386 
Xaa Pro His 
45 

<210> 351 
<211> 307 
<212> DNA 
<213> Homo sapiens 



<220> 
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193 

<221> CDS 
<222> 68. .307 

<221> sig_peptide 

<222> 68. .253 

<223> Von Heijne matrix 

score 5.30000019073486 

seq LACVFFLSHPLFX/XP 

<221> misc_feature 
<222> 279 

<223> n=a, g, c or t 



<400> 351 

ttttactctg taattgttac taattgattt ttgmataggg agcacattcc catggttcaa 60 



aattcaa 


atg 


gta 


tac 


gat 


gaa aaa tct etc 


tec 


tgt tec cat acc 


cca 


109 






Met 


Val 


Tyr 


Asp 


Glu Lys Ser Leu 


Ser 


Cys Ser His Thr 


Pro 












-60 




-55 




-50 






gcc 


acc 


cag 


ttc 


etc 


tec 


tgg gat gea tec 


agt 


gtt tac agt ttc 


tta 


157 


Ala 


Thr 


Gin 


Phe 


Leu 


Ser 


Trp Asp Ala Ser 


Ser 


Val Tyr Ser Phe 


Leu 










-45 






-40 




-35 






tat 


ate 


etc 


tea 


gea 


aga 


gtt aat gta gac 


gta 


dgc agm tac att 


cgt 


205 


Tyr 


He 


Leu 


Ser 


Ala 


Arg 


Val Asn Val Asp Val 


Xaa Xaa Tyr He 


Arg 








-30 








-25 




-20 






gtg 


tac 


ata 


ctt 


gcc 


tgt 


gtg ttt ttc etc 


tea 


cac ccc ctt ttt 


aad 


253 


Val 


Tyr 


He 


Leu 


Ala 


Cys 


Val Phe Phe Leu 


Ser 


His Pro Leu Phe 


Xaa 






-15 










-10 




-5 






sra 


cca 


aat 


ggt 


agt 


gta 


tat tgt cnm cgt 


cat 


tct ccc cct tac 


ctt 


301 


Xaa 


Pro 


Asn Gly 


Ser 


Val 


Tyr Cys Xaa Arg 


His 


Ser Pro Pro Tyr 


Leu 




1 








5 




10 




15 






ttt 


tgc 


















307 



Phe Cys 

<210> 352 
<211> 170 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 56. .169 

<221> sig_peptide 

<222> 56.. 163 

<223> Von Heijne matrix 

score 5,30000019073486 

seq VCLLIISLVLISG/LG 

<400> 352 

gttctttggt gatacaaaca ctgtattttg agtaatcttt tccctatatt tcgaa atg 58 

Met 

ctg cct tta tea cct act aaa ttc eta aat gtg ttc ttg ggc ctg ttc 106 

Leu Pro Leu Ser Pro Thr Lys Phe Leu Asn Val Phe Leu Gly Leu Phe 

-35 -30 -25 -20 

etc tat tat ctt caa ttg gta tgt ctg ctt att att tct ttg gtt ttg 154 

Leu Tyr Tyr Leu Gin Leu Val Cys Leu Leu He He Ser Leu Val Leu 

-15 -10 -5 

ata tct ggg tta ggg g 170 
He Ser Gly Leu Gly 
1 



<210> 353 
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194 

<211> 293 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> X49. .292 

<22l> sig_peptide 

<222> 149. .235 

<223> Von Heijne matrix 

score 5.30000019073486 

seq LNQTLMLLREVLA/SH 

<400> 353 

tttctaatct sbtcaaattt tatcaccata caatcagtgg taktgttgga aatagtgcaa 60 

ctgcattatt gactaccatt gaagaaatgc atttgctaag caaaaaaata ttcttcaatt 120 

agcttgaagt cttcatgcaa gtaaatta atg gac aag gtt gaa etc cca cca 172 

Met Asp Lys Val Glu Leu Pro Pro 
-25 



act gat ctt 


gga cca 


agt 


tct gca 


eta 


aat 


cag 


aca 


etc 


atg 


ttg 


ctg 


220 


Pro Asp Leu 


Gly Pro 


Ser 


Ser Ala 


Leu 


Asn 


Gin 


Thr 


Leu 


Met 


Leu 


Leu 




-20 






-15 








-10 












cgt gaa gtt 


tta gca 


tct 


cac gat 


tct 


tea 


gtk 


gta 


cca 


tta 


gat 


get 


268 


Arg Glu Val 


Leu Ala 


Ser 


His Asp 


Ser 


Ser 


Val 


Val 


Pro 


Leu 


Asp 


Ala 




-5 




1 






5 










10 






cgt caa get 


gat ttt 


gtg 


cag ggg 


g 
















293 


Arg Gin Ala 


Asp Phe 
15 


Val 


Gin Gly 





















<210> 354 
<211> 331 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 148. .330 

<221> sig_peptide 

<222> 148. .243 

<223> Von Heijne matrix 

score 5.30000019073486 
seq LVWLWFWPQTIT/MI 

<221> misc_feature 
<222> 124 

<223> n=a, g, c or t 



<400> 354 

catttctage ttttgdktta aagtgacaga ettgccaetc ttcttttccc ttgaacaett 60 
acaggctgtg ggagggttat tagttggtct aatttcaata mtgttcettt cyceagggaa 120 
ttgnraggcc caaggagagg gagagag atg ggg gga aca get ggt tgg age agt 174 

Met Gly Gly Thr Ala Gly Trp Ser Ser 
-30 -25 
cag aac aca cac aac att kga gta cac cat ctt gtg tgg ctg tgg ttc 222 
Gin Asn Thr His Asn lie Xaa Val His His Leu Val Trp Leu Trp Phe 

-20 -15 -10 

gtg gtc cec caa aca att aca atg ata aca cca aag ate act gaa cac 270 
Val Val Pro Gin Thr He Thr Met He Thr Pro Lys He Thr Glu His 

-5 15 
aga cca sta ata aca gat atr dtr ata atg aya aca ttt gaa awa ttg 318 



wo 99/53051 PCT/1B99/0071 2 

195 

Arg Pro Xaa lie Thr Asp Xaa Xaa lie Met Xaa Thr Phe Glu Xaa Leu 

10 15 20 25 

gga gaa tta ccc a 331 

Gly Glu Leu Pro 



<210> 355 

<211> 93 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 2. .91 



<221> sig_peptide 
<222> 2 . .55 

<223> Von Heijne matrix 

score 5,30000019073486 
seq ALYLCVCVCVCLI/AR 



<400> 355 

t atg tgt ctv agt gta get ttg tat tta tgt gtg tgt gtg tgt gta tgt 49 
Met Cys Leu Ser Val Ala Leu Tyr Leu Cys Val Cys Val Cys Val Cys 
-15 -10 -5 

ctg att gca egg gtg tac ttt tgt att tat gtg tgt gtg tgg tt 93 
Leu lie Ala Arg Val Tyr Phe Cys lie Tyr Val Cys Val Trp 
15 10 



<210> 356 
<211> 178 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 92. ,178 



<221> sig_j>eptide 

<222> 92. .133 

<223> Von Heijne matrix 

score 5.30000019073486 
seq LHLLFGLFPVLWM/FL 



<400> 356 

tgacccttgt ccagtctttt ccaggaaaaa catgccctca agatgttttt ctatcttgag 60 
gaaatgatgg aaatgagata gttccaaggg t atg ctt cac ctt ctt ttt ggc 112 

Met Leu His Leu Leu Phe Gly 
-10 

tta ttt cct gtt ctt tgg atg ttt eta gtg tat ttc ttt ctt tct tct 160 
Leu Phe Pro Val Leu Trp Met Phe Leu Val Tyr Phe Phe Leu Ser Ser 

-5 15 
ttt ttt ttt ttt ttt ttt 178 
Phe Phe Phe Phe Phe Phe 
10 15 



<210> 357 
<211> 107 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 40. .105 



wo 99/53051 
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<221> sig_peptide 

<222> 40. .93 

<223> Von Heijne matrix 

score 5.30000019073486 
seq CVYLFCACMCVCA/FF 

<221> misc_f eature 
<222> 54 

<223> n=a, g, c or t 



<400> 357 

tatatttata taaatatata taaatacaca catatatat atg tat gtg tgt atn 54 

Met Tyr Val Cys Xaa 

-15 

tgt gtg tat ctt ttt tgt gca tgt atg tgt gta tgt get ttt ttt ttt 102 
Cys Val Tyr Leu Phe Cys Ala Cys Met Cys Val Cys Ala Phe Phe Phe 

-10 -5 1 

ttt tt 107 
Phe 



<210> 358 
<211> 209 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 44. .208 



<221> sig_peptide 

<222> 44. .151 

<223> Von Heijne matrix 

score 5.30000019073486 
seq FLFTLIGASLLQS/AS 



<400> 358 

g99g99cgac gtaagggcgc tccgcgagcc cgtctctcct cga atg aaa sga aac 55 

Met Lys Xaa Asn 
-35 

aac etc egg cga cag age cec get etc agg cac tgc tgg aga mce gag 103 
Asn Leu Arg Arg Gin Ser Pro Ala Leu Arg His Cys Trp Arg Xaa Glu 

-30 -25 -20 

acc gac ttc ttt etc ttt acc etc att ggc get tct etc etg cag tec 151 
Thr Asp Phe Phe. Leu Phe Thr Leu lie Gly Ala Ser Leu Leu Gin Ser 

-15 -10 -5 

gee tct ggg ece tgc cgc att tct tsa sme tta aag tgg eat tct aaa 199 
Ala Ser Gly Pro Cys Arg lie Ser Xaa Xaa Leu Lys Trp His Ser Lys 
15 10 15 

ggc act tta a 209 
Gly Thr Leu 



<210> 359 
<211> 298 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 135. .296 



<221> sigjpeptide 
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<222> 135. .194 

<223> Von Heijne matrix 

score 5.30000019073486 

seq LGLGLPLLPPNHP/SV 



<400> 359 

agcatttgcm ttcggagggc cagargggca ggcagagctt aattccttgg gcaaggctgg 60 
ggctgttgga atggggtctg gaggccagga gccaccctgt ctgggccaga aaggggcctk 120 
ggtgcagggc aggc atg tgg ccc aag arg ggg eta ctg gga ttg ggg etc 170 
Met Trp Pro Lys Xaa Gly Leu Leu Gly Leu Gly Leu 
-20 -15 -10 

eca ctg ctg cec cct aac cat ccc teg gta gcc caa ggg aca etc gtt 218 
Pro Leu Leu Pro Pro Asn His Pro Ser Val Ala Gin Gly Thr Leu Val 

-5 1 5 

tec tee eac tet ggt tet ggc tet gag ggt agg gtg geg etc agg agt 266 
Ser Ser His Ser Gly Ser Gly Ser Glu Gly Arg Val Ala Leu Arg Ser 

10 15 20 

gat gtc cae age cec aag aca ace cse caa eg 298 
Asp Val His Ser Pro Lys Thr Thr Xaa Gin 
25 30 



<210> 360 
<211> 460 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 54 . .458 



<221> sig__peptide 

<222> 54. .179 

<223> Von Heijne matrix 

score 5.30000019073486 
seq AMAXLFLSAPPQA/EV 

<22l> misc_f eature 
<222> 150,285,328 
<223> n=a, g, e or t 



<400> 360 

gaggttgggc tgccgtgetg cteggeggcg etgaggccaa atagttgcat cae atg 56 



Met 



tat 


eta 


ate 


cga 


gag 


tet 


cat 


get 


tet 


ggt 


age 


tec 


tea 


gtg 


ace 


age 


104 


Tyr Leu 


He 


Arg 


Glu 


Ser 


His 


Ala 


Ser 


Gly 


Ser 


Ser 


Ser 


Val 


Thr 


Ser 






-40 










-35 










-30 












tee 


tgc 


tea 


ctg 


mcc 


tea 


gra 


age 


ccc 


aac 


cct 


cag 


gea 


atg 


gck 


ncc 


152 


Ser 


Cys 


Ser 


Leu 


Xaa 


Ser 


Xaa 


Ser 


Pro 


Asn 


Pro 


Gin 


Ala 


Met 


Ala 


Xaa 




-25 










-20 










-15 










-10 




ttg 


ttc 


ctg 


tet 


gcc 


eca 


ccc 


cag 


gcc 


gag 


gtg 


ace 


ttc 


gag 


gae 


gtg 


200 


Leu 


Phe 


Leu 


Ser 


Ala 
-5 


Pro 


Pro 


Gin 


Ala 


Glu 


Val 


Thr 


Phe 


Glu 


Asp 


Val 




get 


gtg 


tac 


etc 


tec 


egg 


gag 


gaa 


tgg 


1 

ggc 


cgc 


ctg 


ggc 


5 

cct 


get 


cag 


248 


Ala 


Val 


Xyr 


Leu 


Ser 


Arg 


Glu 


Glu 


Trp 


Gly 


Arg 


Leu 


Gly 


Pro 


Ala 


Gin 








10 










15 










20 










agg 


ggc 


bkc 


tac 


agg 


gae 


gtg 


atg 


ctg 


gag 


ace 


tac 


ngg 


aac 


bta 


gtc 


296 


Arg Gly 


Xaa 


Tyr 


Arg 


Asp 


Val 


Met 


Leu 


Glu 


Thr 


Tyr 


Xaa 


Asn 


Xaa 


Val 






25 










30 










35 












tea 


ctg 


gga 


gta 


gga 


cct 


gea 


ggc 


ccc 


aag 


cnt 


gga 


gtg 


ate 


teg 


cag 


344 


Ser 


Leu 


Gly Val 


Gly 


Pro 


Ala 


Gly 


Pro 


Lys 


Xaa 


Gly 


Val 


lie 


Ser 


Gin 




40 










45 










50 










55 




ttg 


gag 


ega 


ggg 


gat 


gag 


cec 


tgg 


gtc 


ctg 


gat 


gtt 


cag 


ggc 


ace 


tet 


392 
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Leu Glu Arg Gly Asp Glu Pro Trp Val Leu Asp Val Gin Gly Thr Ser 

60 65 70 

ggg aaa gag cac ctg aag aag tea aca gcc cag etc ttg gga eca gaa 44 0 

Gly Lys Glu His Leu Lys Lys Ser Thr Ala Gin Leu Leu Gly Pro Glu 

75 80 85 

ctg aag tac aag gag ttg ay 460 
Leu Lys Tyr Lys Glu Leu 
90 



<210> 361 
<211> 318 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 153. .317 



<221> sig_peptide 

<222> 153 . .263 

<223> Von Heijne matrix 

score 5.30000019073486 
seq ALSSLCVSWGTSS/TV 



<400> 361 

ctcttttccg gttaacgcgg cgtgagaagc catgagcagc aaagtctctc gcgacaccct 60 
gtacgaggcg gtgcgggaag tcctgcacgg gaaccagcgc aasgccgcaa gttcctggag 120 
acggtggagt tgcaggatca gcttgaagaa ct atg ate cec aga agg aca age 173 

Met lie Pro Arg Arg Thr Ser 
-35 

get tct egg gca ceg tea gtc ccc caa aac gca ggc tta agt eca etc 221 

Ala Ser Arg Ala Pro Ser Val Pro Gin Asn Ala Gly Leu Ser Pro Leu 

-30 -25 -20 -15 

cec gee eta agt tct ctg tgt gtg tec tgg ggg acc age age act gtg 269 

Pro Ala Leu Ser Ser Leu Cys Val Ser Trp Gly Thr Ser Ser Thr Val 

-10 -5 1 

acg agg eta agg ccg tgg ata tec ccc aca tgg aca teg agg gcg egg g 318 
Thr Arg Leu Arg Pro Trp He Ser Pro Thr Trp Thr Ser Arg Ala Arg 
5 10 15 



<210> 362 
<211> 360 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> CDS 
<222> 192. -359 



<221> sig_peptide 

<222> 192. .233 

<223> Von Heijne matrix 

score 5.30000019073486 

seq VCIFCFLTSKAFP/NP 

<22l> mis Cofeature 
<222> 277 

<223> n=sa, g, c or t 
<400> 362 

tattgggttg ttttcttatt ateaaattgt gaaagttctt tacatattet gggtagaact 60 
ectttatcag ataeatgttt tgcaaatgtt ttctaccatt ctctgtetdh tctttctett 120 
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aatactttca cagtttttca tagcagaaat ttataaatta atgaagccca ctttatactt 180 
ttatttcttt t atg gtt tgc ate ttt tgt ttc tta act teg aaa get ttt 230 
Met Val Cys lie Phe Cys Phe Leu Thr Ser Lys Ala Phe 
-10 -5 
cct aac cct aga tea cag gat ttt etc tta gat ttc tct agg eat tnt 278 
Pro Asn Pro Arg Ser Gin Asp Phe Leu Leu Asp Phe Ser Arg His Xaa 

15 10 15 

ata ggt tta ggt ttc aca ttt agg tec gca atg eat ttt gaa aae ttc 326 
lie Gly Leu Gly Phe Thr Phe Arg Ser Ala Met His Phe Glu Asn Phe 

20 25 30 

cgt ctg waa ggt ttg ggt caa gat tee ett tgt e 360 
Arg Leu Xaa Gly Leu Gly Gin Asp Ser Leu Cys 
35 40 

<210> 363 
<211> 212 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 138. .212 

<221> sigjpeptide 

<222> 138. .197 

<223> Von Heijne matrix 

score 5.30000019073486 

seq GFCSVTSSPLASA/GR 

<221> niisc_f eature 
<222> 152 

<223> n=ta, g, e or t 



<400> 363 

cacaaaatca aaaackkagt tgacgtatgc cactttccag ttaetattga gatatatatg 60 
cgtgtgtgta tatattacat atatatgtta tatatcatat tkatatattt akaaawttat 120 
atmgavcata catatat atg taw atr tat ktn kkt ava ggg ttt tgc tct 170 
Met Xaa Xaa Tyr Xaa Xaa Xaa Gly Phe Cys Ser 
-20 -15 -10 

gtc aca age agt cct ctt gee tea gca ggt agg act aca cgc 212 
Val Thr Ser Ser Pro Leu Ala Ser Ala Gly Arg Thr Thr Arg 
-5 15 

<210> 364 
<211> 242 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 127. .240 

<221> sig_peptide 

<222> 127. .195 

<223> Von Heijne matrix 

score 5.30000019073486 

seq LVPCPLLISVALS/VK 

<221> misc_feature 

<222> 71,73 

<223> n^a, g, c or t 
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<400> 364 

actttaactt cctggagctc taatttctcc ttctcaggta gaagaatgcc attcactccc 60 
aagtggtgag nnmcaagcag ccagtggtag gaagggtcat caagtcagtt gtcagaaacc 120 
tcactk atg tea ctg twt ahg eta tgt gac cct gac eta gtt cct tge 168 
Met Ser Leu Xaa Xaa Leu Cys Asp Pro Asp Leu Val Pro Cys 
-20 -15 -10 

cct etc ttg ate tea gtt get tta tet gta aaa ttt eae att tkt cag 216 
Pro Leu Leu lie Ser Val Ala Leu Ser Val Lys Phe His lie Xaa Gin 

-5 1 5 

caa gtc aac ctt cca tgt tec tet ca 242 
Gin Val Asn Leu Pro Cys Ser Ser 
10 15 



<210> 365 
<211> 248 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 7. .246 

<221> sigjeptide 

<222> 7. .123 

<223> Von Heijne matrix 

score 5.30000019073486 
seq LXPCLTSFSCXGA/SF 



<400> 365 

tgtaca atg atg ate ctt ate eta att ctt gag eat ate gtc ace kcc 48 
Met Met He Leu He Leu He Leu Glu His He Val Thr Xaa 
-35 -30 
aaa aga aac ecc aaa cct gtt aca gtc cct get ttt ctg esc cct tgc 96 
Lys Arg Asn Pro Lys Pro Val Thr Val Pro Ala Phe Leu Xaa Pro Cys 
-25 -20 -15 -10 

ttg act tet ttc tet tgt ket gga gea tet ttc tet ctk ttw ggt gdg 144 
Leu Thr Ser Phe Ser Cys Xaa Gly Ala Ser Phe Ser Leu Xaa Gly Xaa 

-5 15 
aga agg ggt tgg caa eat ggc age tgc tgc tec acc att ecc tta ttt 192 
Arg Arg Gly Trp Gin His Gly Ser Cys Cys Ser Thr He Pro Leu Phe 

10 15 20 

csa act eta aat tec ctt ggg cag gga etc att ggc cca gee tac ata 240 
Xaa Thr Leu Asn Ser Leu Gly Gin Gly Leu He Gly Pro Ala Tyr He 

25 30 35 

ggt gcd gg 248 
Gly Ala 
40 



<210> 366 

<211> 351 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> 293 . . 349 



<221> sig_peptide 

<222> 293. .340 

<223> Von Heijne matrix 

score 5.30000019073486 
seq HAISILLCIGASS/QG 
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<221> misc 
<222> 36 
<223> n=a, 



feature 
g, c or t 



<400> 366 

aaaacatata tccacacaaa aacttgcaca cataknttca tagcagcatt attcatccaa 60 
aaagtagagg tactcaaatg actttcaact gataaacaca gatgaacaaa atgtatgtcc 120 
aaacagtaga atattattca gctataaaaa agaacagagt acacttagca aactaagaat 180 
agaaggaact tcctcaatct gataaaggac atccatgaaa aacccaccac taatgtcata 240 
cttaatcatg aaaaaccgaa tgcttttctc ctaagatagg aaaaagacaa gt atg tct 298 

Met Ser 
-15 

act cat gcc ate tct att eta ctt tgt att ggt get tct age eag ggc 346 
Thr His Ala lie Ser He Leu Leu Cys He Gly Ala Ser Ser Gin Gly 

-10 -5 1 

agg gg 351 
Arg 



<210> 367 

<211> 208 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 7. .207 



<221> sig_peptide 
<222> 7. .99 

<223> Von Heijne matrix 

score 5.19999980926514 
seq ATVNAASLPPCFG/VK 



<400> 367 

gtctcg atg gag gag caa gaa acg gaa gag gtc ggg ggg aga age age 48 
Met Glu Glu Gin Glu Thr Glu Glu Val Gly Gly Arg Ser Ser 









-30 






-25 










-20 






egg 


aaa 


aat 


gca 


gcc 


ace gtc 


aac gcc 


gcc 


tec 


ctg 


cca 


ccg 


tgc 


ttc 


Arg 


Lys 


Asn 
-15 


Ala 


Ala 


Thr Val 


Asn Ala 
-10 


Ala 


Ser 


Leu 


Pro 

-5 


Pro 


Cys 


Phe 


ggg 


gta 


aaa 


age 


tgc 


cgt tgc 


egt egg 


tgc 


agt 


tgc 


cgt 


egc 


tgc 


etc 


Gly 


Val 


Lys 


Ser 


Cys 


Arg Cys 


Arg Arg 


Cys 


Ser 


Cys Arg Arg 


Cys 


Leu 




1 








5 






10 










15 


eta 


tac 


ttc 


tct 


tgg 


cct egg 


gga agg 


att 


tec 


cea 


ccg 


gtg 


gga 


caa 


Leu 


Tyr 


Phe 


Ser 


Trp 


Pro Arg 


Gly Arg 


He 


Ser 


Pro 


Pro 


Val 


Gly Gin 










20 






25 










30 




tgt 


gcg 


ggg 


agg 


gga 


t 


















CYs 


Ala 


Gly 


Arg 
35 


Gly 





















<210> 368 

<211> 446 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> 11. .445 



<221> sig__peptide 
<222> 11. .109 
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<223> Von Heijne matrix 

score 5.19999980926514 
seq CCCHAGASSGATA/WE 

<400> 368 

agaatccaag atg cgc ggg ate car gca aar ggg tct ccg ggc cag agt 4 9 

Met Arg Gly lie Gin Ala Lys Gly Ser Pro Gly Gin Ser 

-30 -25 



teg 


gcc 


gst 


gtt 


ctg 


wcg 


cet 


tgc 


tgc tgt 


cac 


gcg 


ggc 


get 


teg 


tec 


97 


Ser 


Ala 


Xaa 


Val 


Leu 


Xaa 


Pro Cys 


Cys Cys 


His 


Ala Gly Ala 


Ser 


Ser 




-20 










-15 








-10 










-5 




aacT 




acg 






gag 


gag 


acc 


ccg egg 


teg 


cgt 


tgc 




ate 


gcc 




Gly 


Ala 


Thr 


Ala 




Glu 


Glu 


Thr 


Pro Arg 
5 


Ser 


Arg 


Cys 


His 

1 n 
J. yj 


He 


Ala 




gtt 


keg 


agt 


aca 


aat 


aca 


get 


tea 


agg ggc 


cgc 


ace 


tgg 


tgc 


aga 


gcg 


193 


Val 


Add 


OCi. 


JLIlX, 


Asn 


Thr 


Ala 


Ser Arg Gly Arg Thr Trp 


Cys 


Arg 


Ala 








XD 










20 








25 










acg 


gga 


ccg 


tgc 


cct 


tct 


ggg 


ccc 


acg egg 


gga 


gta 


age 


egg 


age 


aga 


241 


Thr 


Gly 


Pro 


Cys 


Pro 


Ser Gly 


Pro Thr Arg Gly Val 


Ser 


Arg 


Ser 


Arg 






30 










35 








40 












ggg 


ctg 


ggg 


gcc 


ggg 


ttc 


etc 


tec 


ccc ttc 


tgc 


tgc 


etc 


ttc 


gcc 


ttt 


289 


Gly 


Leu 


Gly 


Ala 


Gly 


Phe 


Leu 


Ser 


Pro Phe 


Cys 


Cys 


Leu 


Phe 


Ala 


Phe 




45 










50 








55 










60 




cat 


ccg 


egg 


eta 


ccc 


tgg 


tgt 


get 


gag gtt 


ccc 


gtt 


eca 


gca 


get 


gca 


337 


His 


Pro 


Arg 


Leu 


Pro 
65 


Trp 


Cys 


Ala 


Glu Val 
70 


Pro 


Val 


Pro 


Ala 


Ala 
75 


Ala 




cac 


cat 


atg 


cgc 


tgt 


gga 


ggg 


gae 


etc ctg 


gca 


gcc 


cct 


ccg 


ccg 


ggt 


385 


His 


His 


Met 


Arg 


Cys 


Gly Gly Asp Leu Leu Ala 


Ala 


Pro 


Pro 


Pro Gly 










80 










85 








90 








ccc 


tec 


tgg 


ttc 


gca 


egg 


ttc 


cct 


ccg ett 


gtc 


ccc 


gag 


tct 


ttc 


cct 


433 


Pro 


Ser 


Trp 
95 


Phe 


Ala 


Arg 


Phe 


Pro 
100 


Pro Leu 


Val 


Pro 


Glu 
105 


Ser 


Phe 


Pro 




cac 


cat 


tct 


gtt 


c 






















446 


His 


His 


Ser 


Val 



























110 



<210> 369 
<211> 125 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 22. .123 

<221> sig_peptide 

<222> 22 . . 93 

<223> Von Heijne matrix 

score 5.19999980926514 

seq LIWVFGLVSVLSX/FL 

<400> 369 

ctatcaagag gettteeece t atg ttt tct tct agg agt ttt atg gtt tea 51 

Met Phe Ser Ser Arg Ser Phe Met Val Ser 
-20 -15 
ggt ett att tgg gtc ttt ggt ctt gta tct gtt ttg agt bga ttt ttg 99 
Gly Leu He Trp Val Phe Gly Leu Val Ser Val Leu Ser Xaa Phe Leu 

-10 -5 1 

tgt atg gtg tat gat cag ggt cag gg 125 
Cys Met Val Tyr Asp Gin Gly Gin 
5 10 



<210> 370 
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<211> 132 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 39. .131 

<221> sig_j>eptide 

<222> 39. .77 

<223> Von Heijne matrix 

score 5.19999980926514 

seq MliLAVSLSLVSNC/NF 

<400> 370 

atcttagagg aaagtctttc agtttttccc cattcagt atg tta tta get gtg age 56 

Met Leu Leu Ala Val Ser 
-10 

ctg tec ctt gtc tct aat tgt aac ttt gta etc act gac eaa ett tte 104 
Leu Ser Leu Val Ser Asn Cys Asn Phe Val Leu Thr Asp Gin Leu Phe 

-5 15 
cct gcc cct gcs tec etc ate eee gaa g 132 
Pro Ala Pro Ala Ser Leu lie Pro Glu 
10 15 

<210> 371 
<211> 127 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 4. .126 

<221> sig_peptide 
<222> 4 . .90 

<223> Von Heijne matrix 

score 5.19999980926514 
seq TGVFLFSIIGSFG/FP 

<400> 371 

tga atg aac caa gat ttc aac cca gaa att gag get tea cca caa gtg 48 
Met Asn Gin Asp Phe Asn Pro Glu lie Glu Ala Ser Pro Gin Val 
-25 -20 -15 

aag act ggg gtt ttc ttg ttt tea att att ggg agt ttt gga ttt cca 96 
Lys Thr Gly Val Phe Leu Phe Ser lie lie Gly Ser Phe Gly Phe Pro 

-10 -5 1 

gga atg tgc aat tgt aaa aac cca gcc egg g 127 
Gly Met Cys Asn Cys Lys Asn Pro Ala Arg 
5 10 

<210> 372 
<211> 196 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 125. .196 

<221> sig_peptide 

<222> 125. .184 

<223> Von Heijne matrix 
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score 5.19999980926514 
seq IVSSLFSWLLSLT/SV 



<221> misc_f eature 
<222> 119 

<223> n=a, g, c or t 



<400> 372 

taaaaatctt ttatgttcta cccactcctt cctcgttccc tctccccact cctccctccc 60 
cccatcttaa gcccatggca acccctgatc tttttactgt ctccatcgtt ttgccttbnc 120 
caga atg cca tgt agt tgg agt cat ata gta agt age ctt ttc agt tgg 169 

Met Pro Cys Ser Trp Ser His lie Val Ser Ser Leu Phe Ser Trp 

-20 -15 -10 

ctt ctt tea ctt acc agt gtg ccc ggg 196 
Leu Leu Ser Leu Thr Ser Val Pro Gly 
-5 1 



<210> 373 

<211> 148 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> 56. .148 



<221> sig_peptide 

<222> 56. .139 

<223> Von Heijne matrix 

score 5.19999980926514 

seq PVLSCCCLTAGRA/RL 

<400> 373 

actttcttca cacccaggac gcagggtgcc gctgccggcc acagaaaccc caaga atg 58 

Met 

ttt ttc ttt ggc tat tea gag gac ate tat tgt gtg tea ggc cct gtg 106 
Phe Phe Phe Gly Tyr Ser Glu Asp lie Tyr Cys Val Ser Gly Pro Val 

-25 -20 -15 

ctg age tgt tgt tgc ctg aca gca gga aga gcg egg etc tgg 148 
Leu Ser Cys Cys Cys Leu Thr Ala Gly Arg Ala Arg Leu Trp 
-10 -5 1 



<210> 374 
<211> 200 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 26. .199 



<221> sig_peptide 

<222> 26. .73 

<223> Von Heijne matrix 

score 5.19999980926514 
seq AALICPWSSQVPS/SP 



<400> 374 

ctagggagga ctcaatgctc tttgt atg cct tat gca gcg ctg ate tgt ccc 52 

Met Pro Tyr Ala Ala Leu lie Cys Pro 
-15 -10 
tgg agt tec cag gtt ccc age tec ccc cct gca age ctt gaa gee tee 100 
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Trp Ser Ser Gin Val Pro Ser Ser Pro Pro Ala Ser Leu Glu Ala Ser 

-5 15 
age aae gtc tat etc cag gag age agg gca gcc tat gca agt gtt ccg 148 
Ser Asn Val Tyr Leu Gin Glu Ser Arg Ala Ala Tyr Ala Ser Val Pro 
10 15 20 25 

gca gga cca gaa gtg gee act caa cac acg tec tea eca gtc ace cct 196 
Ala Gly Pro Glu Val Ala Thr Gin His Thr Ser Ser Pro Val Thr Pro 

30 35 40 

atg g 200 
Met 



<210> 375 

<211> 112 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> 52. .111 



<221> sig_peptide 

<222> 52. .105 

<223> Von Heijne matrix 

score 5.19999980926514 
seq LTYSLAFLLFIKA/GT 



<400> 375 

aataaccctt tcacagcact tgcctgtttt taatgaatct aattatteae a atg caa 57 

Met Gin 

ctt tta tat tta aca tac tct tta get ttc ctg eta ttt ate aag get 105 
Leu Leu Tyr Leu Thr Tyr Ser Leu Ala Phe Leu Leu Phe lie Lys Ala 

-15 -10 -5 

gge ace g 112 
Gly Thr 
1 



<210> 376 
<211> 146 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 74 . .145 



<221> sig_peptide 

<222> 74 . .133 

<223> Von Heijne matrix 

score 5.19999980926514 
seq AAAVTSSAAPSRA/RQ 



<400> 376 

ggctggagcg egegectect ageggasegg ggeaattgga aggccgcgec tcaggaaaac 60 
aggatggtag tga atg gca eeg age egc eee agg get gee gee gtc aec 109 
Met Ala Pro Ser Arg Pro Arg Ala Ala Ala Val Thr 
-20 -15 -10 

tec teg geg get eeg agt egt gcg agg eag ggg gee e 146 
Ser Ser Ala Ala Pro Ser Arg Ala Arg Gin Gly Ala 
-5 1 



<210> 377 
<211> 389 
<212> DNA 
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<220> 
<221> CDS 
<222> 218. .388 

<221> sig_peptide 

<222> 218. .343 

<223> Von Heijne matrix 

score 5.19999980926514 
seq QHLLSWAXQXGRX/QV 

<221> misc_feature 
<222> 139 

<223> n=a, g, c or t 



<400> 377 

cattttgtcg gtagaggcag aaggwgaagg tcgggttgta gaagctgggg tggccggcag 60 
ctcgctcatc ggtgttcgtg ggctttgtcg gtccgtgcct cgtctctccc tggaaaggga 120 
gggaggcttc gacgtcgrnr aggragmmgc tgccgcgtta gttccgagct tgaagtcact 180 
aggacttctc tcaaacttgt gtgctgagga gactcag atg ttg gcc tea get cct 235 

Met Leu Ala Ser Ala Pro 
-40 



agg 


ctg 


aac 


tea 


gca gat 


egg 


ccc 


atg 


aaa 


act 


tct 


gta 


ttg 


aga 


caa 


283 


Arg 


Leu 


Asn 


Ser 


Ala Asp Arg Pro Met Lys Thr Ser Val 


Leu Arg 


Gin 






-35 








-30 










-25 












agg 


aag 


gga 


tct 


gtc aga 


aag 


caa 


cac 


ttg 


tta 


tct 


tgg 


get 


tdg 


cag 


331 


Arg 


Lys 


Gly 


Ser 


Val Arg 


Lys 


Gin 


His 


Leu 


Leu 


Ser 


Trp 


Ala 


Xaa 


Gin 




-20 








-15 










-10 










-5 




yaa 


ggh 


aga 


kga 


cag gta 


gtg 


gag 


ate 


etg 


caa 


tct 


gaa 


aag 


cag 


act 


379 


Xaa 


Gly 


Arg 


Xaa 


Gin Val 

1 


Val 


Glu 


He 
5 


Leu 


Gin 


Ser 


Glu 


Lys 
10 


Gin 


Thr 




daa 


rgt 


gac 


g 
























389 


Xaa 


Xaa 


Asp 
15 





























<210> 378 
<211> 143 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> CDS 
<222> 2. .142 

<221> sigjpeptide 

<222> 2. .115 

<223> Von Heijne matrix 

score 5.19999980926514 

seq LHGSLDAVSQAQG/RP 

<400> 378 

a atg tac ccc eta ggc agg gga gag cag ggc cct get gca ccc aag tec 4 9 
Met Tyr Pro Leu Gly Arg Gly Glu Gin Gly Pro Ala Ala Pro Lys Ser 
-35 -30 -25 

tgg ttg etc etc ccc acc aca etg gee etc eat gga age ett gat gca 97 
Trp Leu Leu Leu Pro Thr Thr Leu Ala Leu His Gly Ser Leu Asp Ala 

-20 -15 * -10 

gtg age cag gee caa gga egc ccc ggc cac cct gac gca ccc ccc a 143 
Val Ser Gin Ala Gin Gly Arg Pro Gly His Pro Asp Ala Pro Pro 
-5 15 
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<210> 379 
<211> 261 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 198. .260 

<221> sig_peptide 

<222> 198. .245 

<223> Von Heijne matrix 

score 5.19999980926514 

seq FIAALFTIAETWN/QP 

<400> 379 

cagatggtgg tgaggttgta gagaaaaagg aacgcttata cactgttggt gcgagtgtaa 60 
attagtttaa ccattgtgga agatgatatg gcaattccac aaagacctaa agtcagraat 120 
tmcmattcaa cccagtaatc ccattactgg gtatatactc aaaggaatat aaattgttgt 180 
gttacaaaga cacatgc atg cgt gtg ttc att gca gca ctg ttc aca ata 230 
Met Arg Val Phe lie Ala Ala Leu Phe Thr lie 
-15 -10 
gca gag aca tgg aat caa ccc aaa tgc cca g 261 
Ala Glu Thr Trp Asn Gin Pro Lys Cys Pro 
-5 15 

<210> 380 
<211> 228 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 63. .227 

<221> sig peptide 

<222> 63. .152 

<223> Von Heijne matrix 

score 5.19999980926514 

seq LCFLSVHFRLRWG/DS 

<400> 380 

gggacgtggg aaaatgacta cgcgtcactc gtgatgtcgc gcatccgata ggcccttttc 60 
ag atg gca aaa ggc ctg agg gtg aat ctg ggc gag ctg gtt gag tec 107 

Met Ala Lys Gly Leu Arg Val Asn Leu Gly Glu Leu Val Glu Ser 

-30 

atg cgt ttg tgc ttc etc 
Met Arg Leu Cys Phe Leu 
-15 -10 
tct tgt cca teg tea ect 
Ser Cys Pro Ser Ser Pro 
5 

aat ggt ccc ctg tac cac 
Asn Gly Pro Leu Tyr His 
20 

<210> 381 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 



-25 -20 
tea gtc cac ttt egc tta cga tgg ggc gae 155 
Ser Val His Phe Arg Leu Arg Trp Gly Asp 

-5 1 
cac egg gaa act ttt ect gee ggg cca gtt 203 
His Arg Glu Thr Phe Pro Ala Gly Pro Val 

10 15 
ccc egg g 22 8 

Pro Arg 
25 



wo 99/53051 

<222> 39. .299 



208 



PCT/IB99/00712 



<221> sigjpeptide 

<222> 39. .89 

<223> Von Heijne matrix 

score 5.09999990463257 

seq QLLVLFGSQTGTA/QD 

<400> 381 

agtttttagt ctcagaccag accaccgggc gcgccccg atg ccg age ccg cag ctt 56 

Met Pro Ser Pro Gin Leu 
-15 

ctg gtg etc ttc ggc age cag aca ggc acg get cag gat gtg teg gag 104 
Leu Val Leu Phe Gly Ser Gin Thr Gly Thr Ala Gin Asp Val Ser Glu 

-10 -5 15 

aga ctg ggt cgc gag gcc egg ggc egg egg ctt ggc tgc egg gtg cag 152 
Arg Leu Gly Arg Glu Ala Arg Gly Arg Arg Leu Gly Cys Arg Val Gin 

10 15 20 

gcc ctg gac tec tac ccg gtg gtg aat ctg att aac gag ccc ctg gtg 200 
Ala Leu Asp Ser Tyr Pro Val Val Asn Leu He Asn Glu Pro Leu Val 

25 30 35 

ata ttt gtt tgt gea act ayw ggc caa gga gac eee cet gae aac atg 248 
He Phe Val Cys Ala Thr Xaa Gly Gin Gly Asp Pro Pro Asp Asn Met 

40 45 50 

aag aac ttc tgg agg ttt ata ttc egg aag aac ctg ccc tec ace gee 296 
Lys Asn Phe Trp Arg Phe He Phe Arg Lys Asn Leu Pro Ser Thr Ala 

55 60 65 

egg g 
Arg 
70 

<210> 382 
<211> 151 
<212> DNA 
<213> Homo sapiens 



300 



<220> 
<221> CDS 
<222> 8. .151 



<221> sig_peptide 

<222> 8. .130 

<223> Von Heijne matrix 

score 5.09999990463257 

seq SFLFLACIFQGXS/XX 

<400> 382 

atacata atg tet tec att ttg ggt gtc 
Met Ser Ser He Leu Gly Val 
-40 -35 
tat tat ggc tat tgt ata ttt gtt aaa 
Tyr Tyr Gly Tyr Cys He Phe Val Lys 

-25 -20 
ctg ttc ctt gcc tgt att ttt caa ggc 
Leu Phe Leu Ala Cys He Phe Gin Gly 

-10 -5 
caa age 
Gin Ser 



tea tec tea tgg tgg tat tta 4 9 

Ser Ser Ser Trp Trp Tyr Leu 
-30 

aag tgc tct ttt tgc agt ttc 97 

Lys Cys Ser Phe Cys Ser Phe 
-15 

tkt tek ekt kat wea aac aca 145 

Xaa Ser Xaa Xaa Xaa Asn Thr 
1 5 

151 



<210> 383 
<211> 255 
<212> DNA 
<213> Homo sapiens 



wo 99/53051 



209 



PCT/IB99/00712 



<220> 
<221> CDS 
<222> 101. .253 

<22l> sigjpeptide 

<222> 101. .184 

<223> Von Heijne matrix 

score 5.09999990463257 

seq CLCGSAPCLLCRC/CP 

<400> 383 

gcgtccggaa gtgtctcgca gatagtaaat aatctcggaa aggcgagaaa gaagctgtct 60 
ccatcttgtc tgtatccgct gcwcttgtga cgttgtggag atg ggg age gtc ctg 115 

Met Gly Ser Val Leu 

-25 



ggg ctg tgc 


tec 


atg 


gcg 


age 


tgg 


ata 


cca tgt 


ttg 


tgt 


gga 


agt 


gee 


163 


Gly Leu Cys 


Ser 


Met 


Ala 


Ser Trp 


lie 


Pro Cys 


Leu 


Cys 


Gly Ser Ala 






-20 










-15 








-10 








ccg tgt ttg 


eta 


tgc 


cga 


tgc 


tgt 


cct 


agt gga 


aac 


aac 


tec 


act 


gta 


211 


Pro Cys Leu 
-5 


Leu 


Cys 


Arg 


Cys 


Cys 
1 


Pro 


Ser Gly 


Asn 
5 


Asn 


Ser 


Thr 


Val 




act aga ttg 


ate 


tat 


gca 


ctt 


ttc 


ttg 


ctt gtt 


gga 


gta 


tgg 


gg 




255 


Thr Arg Leu 


lie 


Tyr 


Ala 


Leu 


Phe 


Leu 


Leu Val 


Gly 


Val 


Trp 









10 15 20 

<210> 384 
<211> 456 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 128. .454 

<221> sig_peptide 

<222> 128 . .265 

<223> Von Heijne matrix 

score 5.09999990463257 

seq IGSCSVMSSGALC/VP 

<400> 384 

tacaactttg aaaagecctt ectctggctt gctagacagc tcattggaga ccgtaacttg 60 
gaatttgttg ccatgeetgc tcttgcctca ccagagattg tcatggaccc aaatttggca 120 



gtgtagt 


atg 


age 


wkr 


wtt 


agm 


agg 


ttg 


stt 


aga 


caa 


ctg 


etc 


tec 


cag 


169 






Met 


Ser 
-45 


Xaa 


Xaa 


Xaa 


Arg 


Leu 
-40 


Xaa 


Arg 


Gin 


Leu 


Leu 
-35 


Ser 


Gin 




rtg agg 


rwg 


atg 


acc 


tgt 


gag 


aat 


gaa 


get 


gga 


gee 


cag 


tgt 


car 


aag 


217 


Xaa Arg 


Xaa 


Met 


Thr 


Cys 


Glu 


Asn 


Glu 


Ala 


Gly 


Ala 


Gin 


Cys 


Gin 


Lys 








-30 










-25 










-20 










tct 


agt 


ttt 


ata 


ggc 


age 


tgt 


tct 


gtg 


atg 


tea 


agt 


ggt 


gca 


ctg 


tgt 


265 


Ser 


Ser 
-15 


Phe 


lie 


Gly 


Ser 


Cys 
-10 


Ser 


Val 


Met 


Ser 


Ser 
-5 


Gly 


Ala 


Leu 


Cys 




gtg 


cca 


ctt 


tat 


tat 


eta 


get 


aag 


ggc 


aac 


atg 


tgc 


tee 


ate 


tgt 


ggg 


313 


Val 


Pro 


Leu 


Tyr 


Tyr 


Leu 


Ala 


Lys 


Gly 


Asn 


Met 


Cys 


Ser 


He 


Cys 


Gly 




1 








5 










10 










15 






atg 


ctg 


aag 


gag 


atg 


aat 


ggg 


ctt 


tgg 


agt 


gaa 


tgt 


gae 


agt 


tta 


aaa 


361 


Met 


Leu 


Lys 


Glu 
20 


Met 


Asn 


Gly 


Leu 


Trp 

25 


Ser 


Glu 


Cys 


Asp 


Ser 
30 


Leu 


Lys 




aat 


acc 


ttc 


att 


gtt 


tgg 


rcc 


tgc 


ata 


ttt 


age 


tgt 


ttg 


gga 


atg 


caa 


409 


Asn 


Thr 


Phe 
35 


He 


Val 


Trp 


Xaa 


Cys 
40 


He 


Phe 


Ser 


Cys 


Leu 
45 


Gly 


Met 


Gin 




ttg 


awt 


tct 


tct 


kgr 


gtt 


tea 


aat 


gta 


aga 


ctg 


eta 


ctg 


tea 


cat 


ca 


456 
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210 

Leu Xaa Ser Ser Xaa Val Ser Asn Val Arg Leu Leu Leu Ser His 
50 55 60 



<210> 385 
<211> 193 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 1. .192 

<221> sig_peptide 
<222> 1. .78 

<223> Von Heijne matrix 

score 5.09999990463257 
seq AFPFVCLTFCVGG/GP 

<400> 385 

atg cct cat cca ctg get acc tct gcg ttt ctg cgt tec gcc ttt cct 48 
Met Pro His Pro Leu Ala Thr Ser Ala Phe Leu Arg Ser Ala Phe Pro 

-25 -20 -15 

ttt gtt tgt etc acg ttt tgc gtg gga ggc ggt cec ggg att tea ggg 96 
Phe Val Cys Leu Thr Phe Cys Val Gly Gly Gly Pro Gly lie Ser Gly 
-10 -5 15 

gtc tac egg etc ctt atg gcg aat gca acc cga aga gag agt gag gta 144 
Val Tyr Arg Leu Leu Met Ala Asn Ala Thr Arg Arg Glu Ser Glu Val 

10 15 20 

age etc ege ggg ttg ggc agg gae gga gag ggg gee ege gcg act cca g 193 
Ser Leu Arg Gly Leu Gly Arg Asp Gly Glu Gly Ala Arg Ala Thr Pro 
25 30 35 



<210> 386 
<211> 281 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 199. .279 



<221> sig_peptide 

<222> 199. .267 

<223> Von Heijne matrix 

score 5.09999990463257 
seq SLMVFLNLFFLNC/DP 



<400> 386 

tgtttatagg ttttaaetet tatggttaga atggttgtga gteataegwg tgtcagaect 60 
ctgctaattt cctcaggaca eatteecaga agtggaatta ecaagtcaaa gagcataaat 120 
actttagaga tacatgataa attgtgeeag etaeetttec aaaagagttg tactagttga 180 
ggtttctgcc agcagtat atg aca gtt ggg etc cat att tta aga gat tea 231 

Met Thr Val Gly Leu His lie Leu Arg Asp Ser 
-20 -15 
eta atg gtg ttt etc aac ctt ttt ttt tta aac tgt gae cca cac agg 279 
Leu Met Val Phe Leu Asn Leu Phe Phe Leu Asn Cys Asp Pro His Arg 
-10 -5 1 

QQ 281 



<210> 387 
<211> 111 
<212> DNA 
<213> Homo sapiens 
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211 

<220> 
<221> CDS 
<222> 5. .109 

<221> sig_peptide 
<222> 5. .67 

<223> Von Heijne matrix 

score 5.09999990463257 
seq MFCVSLLLHHAYP/LP 

<400> 387 

cacc atg gta aga tgg gga cat ccc cct atg ttc tgt gtc tct etc ctg 4 9 

Met Val Arg Trp Gly His Pro Pro Met Phe Cys Val Ser Leu Leu 
-20 -15 -10 

etc cac cat get tat cct ttg cct tec acc atg att gta agt ttc cca 97 
Leu His His Ala Tyr Pro Leu Pro Ser Thr Met lie Val Ser Phe Pro 

-5 15 10 

agg cct ccc ctg gg lil 
Arg Pro Pro Leu 

<210> 388 
<211> 374 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 96. .374 

<221> sig_jpeptide 

<222> 96. .173 

<223> Von Heijne matrix 

score 5.09999990463257 

seq AMVCFGCPGGASS/RC 

<221> misc_feature 
<222> 344 

<223> n=a, g, c or t 



<400> 388 

ttttggccgc catgttttcg tcgeagtaac tgccttggtg tcagtagtca ttgccagttt 60 
cgggcgttct ggacaattgg gatgctgcag agttc atg get ggg get get cgt 113 

Met Ala Gly Ala Ala Arg 

161 



209 



257 



305 



353 



374 

























25 










tgg 


gtg 


gga 


caa 


kaa 


tec 


tct 


gca 


atg 


gtt 


tgt 


ttt 


ggc 


tgc 


cca 


gga 


Trp 


Val 


Gly 


Gin 


Xaa 


Ser 


Ser 


Ala 


Met 


Val 


Cys 


Phe 


Gly 


Cys 


Pro 


Gly 


-20 










-15 










-10 










-5 


ggt 


gcg 


tea 


agt 


cgc 


tgc 


cgc 


tec 


cct 


cgt 


ggg 


cgt 


cag 


gee 


tea 


aga 


Gly 


Ala 


Ser 


Ser 


Arg 


Cys 


Arg 


Ser 


Pro 


Arg 


Gly 


Arg 


Gin 


Ala 


Ser 


Arg 










1 








5 










10 






gtt 


ccc 


cgc 


eta 


gaa 


aat 


gga 


get 


cag 


cga 


gtc 


gtg 


cgt 


acc 


atg 


gtg 


Val 


Pro 


Arg 


Leu 


Glu 


Asn 


Gly 


Ala 


Gin 


Arg 


Val 


Val 


Arg 


Thr 


Met 


Val 






15 










20 










25 








cac 


ctg 


gtt 


ttg 


cag 


cct 


aag 


egg 


gtc 


act 


tta 


gtg 


cat 


cct 


cct 


cgc 


His 


Leu 


Val 


Leu 


Gin 


Pro 


Lys 


Arg 


Val 


Thr 


Leu 


Val 


His 


Pro 


Pro 


Arg 




30 










35 










40 










gga 


ttg 


gag 


cct 


gtt 


tgc 


ace 


cct 


ata 


gem 


vga 


atg 


am 


ccc 


aag 


tea 


Gly 


Leu 


Glu 


Pro 


Val 


Cys 


Thr 


Pro 


lie 


Ala 


Xaa 


Met 


Xaa 


Pro 


Lys 


Ser 


45 










50 










55 










60 


cac 


ggg 


etc 


aga 


agt 


tct 


ttg 




















His 


Gly 


Leu 


Arg 


Ser 


Ser 


Leu 
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212 

65 

<210> 389 
<211> 192 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 52. .192 

<221> sigjpeptide 

<222> 52. .153 

<223> Von Heijne matrix 

score 5.09999990463257 

seq PXLLSXLHGLLYG/SP 

<400> 389 

ggcagacttc aaccaggctg tgggaggaga gctcagtggg gcacagagaa g atg ggt 57 

Met Gly 

gtt gtc agt ggg ggt gtt ggt gac ttg acc aca aaa acc caa gag aat 105 
Val Val Ser Gly Gly Val Gly Asp Leu Thr Thr Lys Thr Gin Glu Asn 

-30 -25 -20 

ggg etc tta cca gvc cty etc tec wkc ctk cac gga ctg etc tat ggc 153 
Gly Leu Leu Pro Xaa Leu Leu Ser Xaa Leu His Gly Leu Leu Tyr Gly 

-15 -10 -5 

age cct gat gea gar etc acg ggc ccg gat cec tgg gat 192 
Ser Pro Asp Ala Glu Leu Thr Gly Pro Asp Pro Trp Asp 
15 10 

<210> 390 
<211> 371 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 321. .371 

<221> sig_peptide 

<222> 321. .365 

<223> Von Heijne matrix 

score 5.09999990463257 

seq FLSXTCVLSCXRS/LS 

<400> 390 

tctgttcagg ttttgtattt gttcatagta taatcttggt ttgtaggttg tgtgtatctg 60 
ggaagaaact ttacaatetc taacaggcct ggaaggtcta atctataaaa gtatttcatt 120 
gaccttgaag aaggtcaatt atttatataa gaaaataaac tcaacatttt atccataaaa 180 
aatgtaattc cggaatttat gttagtaaaa ttataacact gataacataa aaagtgctat 240 
taatccttaa gaaagagtta ccttttcttt tctatcttca tcacagctag cccagtctta 300 
gtetatttca ttagcttcct atg ggc ttc etc tea ckt aca tgc gtg etc tet 353 

Met Gly Phe Leu Ser Xaa Thr Cys Val Leu Ser 
-15 -10 -5 

tgc dtg egc teg etc tct 371 
Cys Xaa Arg Ser Leu Ser 
1 

<210> 391 
<211> 328 
<212> DNA 
<213> Homo sapiens 



wo 99/53051 PCT/IB99/007 1 2 

213 

<220> 
<221> CDS 
<222> 184. .327 

<221> sigjpeptide 
<222> 184, .300 
<223> Von Heijne matrix 
score 5 

seq LVCFFNSVSFLFG/VS 
<400> 391 

ccgttatgtg ttcagctcaa ttagattaat taccttcctc accaggagtc acaatgcttt 60 
gcagtttatc tgcggtaact aaatgttagt tttgtaagta aaaggtactg ttattgacct 120 
cgaaagggct atagttcctt tgaacttaca gagaagagtt ccaaacaact atttctaacc 180 
aag atg gaa tat ggg tea gca aaa ttg tct tea ggt aga gtt ttc tac 228 
Met Glu Tyr Gly Ser Ala Lys Leu Ser Ser Gly Arg Val Phe Tyr 
-35 -30 -25 

ttg cca aga gac ttt ggc att gag agg aga gtt ctt gtt tgt ttt ttt 276 
Leu Pro Arg Asp Phe Gly lie Glu Arg Arg Val Leu Val Cys Phe Phe 

-20 -15 -10 

aac tct gta tea ttt ctg ttt ggt gte tct ara aaa aaa tec gra caa 324 
Asn Ser Val Ser Phe Leu Phe Gly Val Ser Xaa Lys Lys Ser Xaa Gin 

-5 15 
tgg g 328 
Trp 

<210> 392 
<211> 303 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 252. .302 

<221> sig_peptide 
<222> 252. .290 
<223> Von Heijne matrix 
score 5 

seq MLSGLVLNSWALA/YQ 
<400> 392 

tgaccttgta gcagttatct ttgttaaact ccttcatttc ttattttaaa taattaatta 60 
attaatttag agacagggtc teactatgtc acceaggctg tagtgcagtg gtgcaatcat 120 
ggctcactgt agccttgacc tcccaggctc aagcaatctt cctacctcag cetctcaggc 180 
agctgggact acagacccac agcactacgc ctgacttatg attttatttt ttgtggagac 240 
agggtcttac t atg ttg tct ggg ctt gtc tta aac tct tgg gcc tta gcc 290 
Met Leu Ser Gly Leu Val Leu Asn Ser Trp Ala Leu Ala 
-10 -5 
tac caa eta get g 303 
Tyr Gin Leu Ala 
1 

<210> 393 
<211> 366 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 298. .366 



<221> sig_j>eptide 



wo 99/53051 PCT/1B99/0071 2 

214 

<222> 298. .345 
<223> Von Heijne matrix 
score 5 

seq VFFXGXSIILVLG/ST 

<221> misc^feature 
<222> 265 

<223> n=a, c or t 



<400> 393 

tttttccccg cccctgagac cctgcagcac catctgtcat ggcggctggg ctgtttggtt 60 
tgagcgctcg ccgtcttttg gcggcagcgg cgacgcgagg gctcccggcc gcccgcgtcc 120 
gctgggaatc tagcttctcc argamytgtg gtcgccccgt ccgctgtggc gggaaagcgg 180 
tccccagaac cgaccacacc gtggcaagag gacccagaac ccgaggacga aaacttgtat 240 
gagaagaasc cagactccca tggknatgac aaggaccccg ttttggacgt ctggaac 297 
atg cga ctt gtc ttc ttc ktw ggc gks tec ate ate ctg gtc ctt ggc 34 5 

Met Arg Leu Val Phe Phe Xaa Gly Xaa Ser lie lie Leu Val Leu Gly 

-15 -10 -5 

age ace ttt gkg gee tat ctg 366 
Ser Thr Phe Xaa Ala Tyr Leu 
1 5 



<210> 394 

<211> 126 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 21. .125 



<221> sigjpeptide 
<222> 21. .68 
<223> Von Heijne matrix 
score 5 

seq SDFFLLFVSLSLS/PF 



<400> 394 

agcttggcat ataggctcaa atg tta tea tea gat ttt ttt etc etc ttt gtc 53 

Met Leu Ser Ser Asp Phe Phe Leu Leu Phe Val 
-15 -10 
tct tta tet tta tet cca ttt cct ttt ttt ctt ttt cct ccc etc ttt 101 
Ser Leu Ser Leu Ser Pro Phe Pro Phe Phe Leu Phe Pro Pro Leu Phe 
-5 15 10 

tec tgc ttt etc tta ccc ace egg g 126 
Ser Cys Phe Leu Leu Pro Thr Arg 
15 



<210> 395 

<211> 329 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> 154 . .327 



<221> sigjpeptide 
<222> 154. .195 
<223> Von Heijne matrix 
score 5 

seq FIAALFTVAKIWN/QP 



wo 99/53051 
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<400> 395 

tgaaaatgta aattagtgca gttattatgg magtcagtat ggaacttcct caaaaaacta 60 
acaataaaac tcccatatga tccagcaatc ctaccactgr atatttatcc aaagga'aagg 120 
aagtcggtat atttaacagg catctgcacc ccc atg ttt att gca gca eta ttc 174 

Met Phe He Ala Ala Leu Phe 
-10 

aca gta gee aag ata tgg aat caa cct aaa tgt cca tea acg gat gaa 222 
Thr Val Ala Lys He Trp Asn Gin Pro Lys Cys Pro Ser Thr Asp Glu 

-5 15 
tgg ata aat aaa atg tgg tac ata tac aca atg gag tac tat cca gae 270 
Trp He Asn Lys Met Trp Tyr He Tyr Thr Met Glu Tyr Tyr Pro Asp 
10 15 20 25 

ata aaa aag aat gga att ctg aca ttt aag gca aca agg atg aac egg 318 
He Lys Lys Asn Gly He Leu Thr Phe Lys Ala Thr Arg Met Asn Arg 

30 35 40 

aag aca tta tg 329 
Lys Thr Leu 

<210> 396 

<211> 99 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 5. .97 

<221> sig_peptide 
<222> 5 . .49 

<223> Von Heijne matrix 
score 5 

seq VCGCLCVWMCVCG/XV 

<221> misc_f eature 
<222> 49 

<223> n=a, g, c or t 



<400> 396 

gtat atg tgt gtg tgt ggg tgt tta tgt gtg tgg atg tgt gtg tgt ggn 49 
Met Cys Val Cys Gly Cys Leu Cys Val Trp Met Cys Val Cys Gly 
-15 -10 -5 

wtt gtg tgt ata tac ata tgm gtg tat gtg tgt aca tgt gtg agg ggg 97 
Xaa Val Cys He Tyr He Xaa Val Tyr Val Cys Thr Cys Val Arg Gly 



1 


5 


ga 




<210> 


397 


<211> 


316 


<212> 


DNA 


<213> 


Homo sapiens 


<220> 




<221> 


CDS 


<222> 


134. .316 


<221> 


sigjpeptide 


<222> 


134. .211 


<223> 


Von Heijne matrix 




score 5 




seq LSLFVFFWLVGFS/FF 
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216 

<221> misc^feature 
<222> 284. .285 
<223> n=a, g, c or t 



<400> 397 

ttgtgaactc ttctattatt attaagtgtt gtcaattgtc agcatccata ttctattccg 60 
atgatgaata gaagcattat atttcagcat caaaatgcag ttggggtcgt aatgagcatc 120 
attagggacc tta atg gga gtc aga act gta tgt cat ttt att cag gtt 169 
Met Gly Val Arg Thr Val Cys His Phe He Gin Val 
-25 -20 -15 

ttt eta agt tta ttt gtg ttt ttt tgg tta gtt ggt ttt tct ttt ttc 217 
Phe Leu Ser Leu Phe Val Phe Phe Trp Leu Val Gly Phe Ser Phe Phe 

-10 -5 1 

ttt ttt tta cdb ttt tct acc aag cag gtg aga gtw gaa cag cat tgt 265 
Phe Phe Leu Xaa Phe Ser Thr Lys Gin Val Arg Val Glu Gin His Cys 

5 10 15 

gat ttt aaa agt aca cca nnd gta gag tct tec agt acc gtt ggc cat 313 
Asp Phe Lys Ser Thr Pro Xaa Val Glu Ser Ser Ser Thr Val Gly His 

20 25 30 

gcc 316 
Ala 
35 



<210> 398 
<211> 251 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 63. .251 



<221> sig_peptide 
<222> 63 . . 143 
<223> Von Heijne matrix 
score 5 

seq LSCFYLLAIVSNA/VM 



<400> 398 

atgttgtagc ttctgtcata atttccttcc cttttaaggc tgaataattt tccattgtgt 60 
at atg tac cat att ttg ttc ate cat tea ttc att gat aga tac ttg 107 
Met Tyr His He Leu Phe He His Ser Phe He Asp Arg Tyr Leu 
-25 -20 -15 

agt tgc ttc tac ctt ttg gca att gtg agt aat get gtt atg aac atg 155 
Ser Cys Phe Tyr Leu Leu Ala He Val Ser Asn Ala Val Met Asn Met 

-10 -5 1 

ggt gta caa atg tct gtt ttg agt cct tgt ttt get ttc gtg cat tct 203 
Gly Val Gin Met Ser Val Leu Ser Pro Cys Phe Ala Phe Val His Ser 
5 10 15 20 

att aaa aat gtt aag gtt ctt tgc ttt tta ctt ttt ttt etc ttt ggg 251 
He Lys Asn Val Lys Val Leu Cys Phe Leu Leu Phe Phe Leu Phe Gly 
25 30 35 



<210> 399 
<211> 120 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 10. .120 



wo 99/53051 



PCT/IB99/00712 



217 

<221> sig_peptide 
<222> 10. .75 
<223> Von Heijne matrix 
score 5 

seq VQWLLVYSPSCAA/TI 
<400> 399 

tcatttacc atg cag ttc acc gtt tta atg tgt cca gtt cag tgg ttg tta 51 
Met Gin Phe Thr Val Leu Met Cys Pro Val Gin Trp Leu Leu 
-20 -15 -10 

gtg tat tea ccc agt tgt gca gcc acc ate aca gtc aat ttt aaa aca 99 
Val Tyr Ser Pro Ser Cys Ala Ala Thr lie Thr Val Asn Phe Lys Thr 

-5 15 
ttt tea tea ccc caa ace ggg 120 
Phe Ser Ser Pro Gin Thr Gly 
10 15 

<210> 400 
<211> 463 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 342. .461 

<221> sig_peptide 
<222> 342. .452 
<223> Von Heijne matrix 
score 5 

seq VSCLSAGLRVCCS/QR 

<221> misc_feature 

<222> 246,260 

<223> n=a, g, c or t 



<400> 400 

ctctgtcccc gcggctgggt ctcgtctgct ceggttcctg ggctcctaat tettggtcca 60 
gcttcttcca ggcacatcct cttctctgcc ctccgtccat tttggagccg gagatggtgg 120 
gctkggggcc gccccagtag tgagacagtg gaagtaaacc ccatctgccg ttcccgtgcg 180 
tagagaaaaa cgttgaccgc gaggetgggg aggagagttg cctetgagga agaagggcac 240 
agaganccaa aattagtttn gaaagcatec tgatttggtg eccgaggcct ggaaagaaat 300 
ggcggctggg gtgcggcgga ggtaggggag ga^aacgttg g atg aga agg gcc tgg 356 

Met Arg Arg Ala Trp 
-35 

act cag gaa agg gaa ccg cgt ccg tgt gag ccc get gag cgc gca gac 404 
Thr Gin Glu Arg Glu Pro Arg Pro Cys Glu Pro Ala Glu Arg Ala Asp 

-30 -25 -20 

cct gcc cct gtc tec tgt ctg tet gca ggt ctg cgc gtc tgt tgt tec 452 
Pro Ala Pro Val Ser Cys Leu Ser Ala Gly Leu Arg Val Cys Cys Ser 

-15 -10 -5 

cag cgc tet gc 463 
Gin Arg Ser 
1 

<210> 401 
<211> 206 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
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<222> 94. .204 

<221> sig_peptide 

<222> 94. .168 

<223> Von Heijne matrix 

score 4.90000009536743 

seq DFFICLLAICVSS/FE 

<400> 401 

tactgtttat tgattctttg attatggcca ttcttacagg agtaaggtgg tatcacactg 60 
tggttttgat ttgcatttcc ctgatcatta gtg atg ttg cat ttg att tgc att 114 

Met Leu His Leu lie Cys lie 
-25 -20 
tec ctg ate gtt aat gat ttt ttc ata tgt ttg ttg gee att tgc gta 162 
Ser Leu He Val Asn Asp Phe Phe He Cys Leu Leu Ala He Cys Val 

-15 -10 -5 

tct tct ttt gag aat tgt eta ttt atg tec tta gee cac agt gg 206 
Ser Ser Phe Glu Asn Cys Leu Phe Met Ser Leu Ala His Ser 
15 10 

<210> 402 
<211> 330 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 42. .329 

<221> sig_jpeptide 

<222> 42. .230 

<223> Von Heijne matrix 

score 4.90000009536743 

seq VTSL7^LIPPVKA/XP 

<400> 402 

acagggctcc actgcagtta ggagccggtg agtccgggtg g atg agg tea gag cgc 56 

Met Arg Ser Glu Arg 
-60 

ccc atg gtg tgg tgc tgc etc ttt gtc cgt teg cag cga aaa egg aaa 104 
Pro Met Val Trp Cys Cys Leu Phe Val Arg Ser Gin Arg Lys Arg Lys 

-55 -50 -45 

cag age ace caa gat gaa gat get gtt age ett tgc agt etc gae ata 152 
Gin Ser Thr Gin Asp Glu Asp Ala Val Ser Leu Cys Ser Leu Asp He 

-40 -35 -30 

agt gag cct agt aat aaa egg gtc aaa ccc ctt tec cga gtc aeg teg 200 
Ser Glu Pro Ser Asn Lys Arg Val Lys Pro Leu Ser Arg Val Thr Ser 

-25 -20 -15 

eta gea aac etc ate ccg ecc gtg aag gee ayg eca tta aag cgc ttc 24 8 

Leu Ala Asn Leu He Pro Pro Val Lys Ala Xaa Pro Leu Lys Arg Phe 
-10 -5 15 

agt caa ace ctg cag cgc tec att age ttc cgc agt gag agt cgc cct 296 
Ser Gin Thr Leu Gin Arg Ser He Ser Phe Arg Ser Glu Ser Arg Pro 

10 15 20 

gac ate etc gee ccc cga ccc tgg tec aga aat g 
Asp He Leu Ala Pro Arg Pro Trp Ser Arg Asn 

25 30 

<210> 403 
<2X1> 311 
<212> DNA 
<213> Homo sapiens 



330 



wo 99/53051 

<220> 
<221> CDS 
<222> 168. .311 

<221> sig_j>eptide 
<222> 168. .227 
<223> Von Heijne matrix 

score 4.90000009536743 

seq CILISTAFPSLLT/QI 

<400> 403 

tgagcagatg gtgccaggat ttaaacctat gtttatcaga tgcagatgac ccaaacagtg 60 
gcttatctgt tggtaatttt tatttagatc aagttaaaca taaatgactt tgcattactc 120 
tttggtcact ttttcctagt catttcaaat agtctgtctt atttctc atg gtt ttt 176 

Met Val Phe 
-20 

tgg aca aaa ttt tgt att tta att agt aca gca ttt cct tct tta ttg 224 
Trp Thr Lys Phe Cys lie Leu lie Ser Thr Ala Phe Pro Ser Leu Leu 

-15 -10 -5 

aca cag att att ttc cct aaa tct att aca ttt get ttc cag ttt ttc 272 
Thr Gin lie lie Phe Pro Lys Ser lie Thr Phe Ala Phe Gin Phe Phe 

15 10 15 

tgg aac agg gaa aaa caa aaa aca aaa aca cca act ggg 311 
Trp Asn Arg Glu Lys Gin Lys Thr Lys Thr Pro Thr Gly 
20 25 

<210> 404 
<211> 274 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 80. .274 

<221> sig_peptide 

<222> 80. .190 

<223> Von Heijne matrix 

score 4.90000009536743 

seq MLIMLGIFFNVHS/AV 

<400> 404 

ccctgcgagg gcatcctggg ctttctccca ccgctttccg agcccgcttg cacctcggcg 60 
atccccgact cccttcttt atg gcg teg etc ctg tgc tgt ggg ccg aag ctg 112 

Met Ala Ser Leu Leu Cys Cys Gly Pro Lys Leu 
-35 -30 



gcc 


gcc 


tgc 


ggc 


ate 


gtc 


etc 


age 


gcc 


tgg gga 


gtg 


ate 


atg 


ttg 


ata 


160 


Ala 


Ala 


Cys 


Gly 


He 


Val 


Leu 


Ser 


Ala 


Trp Gly Val 


He 


Met 


Leu 


He 






-25 










-20 








-15 












atg 


etc 


gga 


ata 


ttt 


ttc 


aat 


gtc 


cat 


tec get 


gtg 


ttg 


att 


gag 


gae 


208 


Met 


Leu 


Gly 


lie 


Phe 


Phe 


Asn 


Val 


His 


Ser Ala 


Val 


Leu 


He 


Glu 


Asp 




-10 










-5 








1 








5 






gtt 


cec 


ttc 


acg 


gag 


aaa 


gat 


ttt 


gag 


aat ggc 


ccc 


cag 


aac 


ata 


tac 


256 


Val 


Pro 


Phe 


Thr 


Glu 


Lys 


Asp 


Phe 


Glu Asn Gly 


Pro 


Gin 


Asn 


He 


Tyr 










10 










15 








20 








aac 


ctt 


tac 


gag 


cat 


ggg 




















274 


Asn 


Leu 


Tyr 
25 


Glu 


His 


Gly 























<210> 405 
<211> 153 
<212> DNA 
<213> Homo sapiens 



PCT/IB99/00712 

219 
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<220> 
<221> CDS 
<222> 69. .152 

<221> sig_peptide 

<222> 69. .116 

<223> Von Heijne matrix 

score 4.90000009536743 

seq SALLLEXLQXAIP/RX 

<400> 405 

tttcccctgc cctgtcctct cattcccctt cttctggagc atttcatcca cagacccctt 60 
gcccaaga atg tct gtc tea get etg ctt eta gag mtc etc eaa gmt gee 110 
Met Ser Val Ser Ala Leu Leu Leu Glu Xaa Leu Gin Xaa Ala 
-15 -10 -5 

ate cct egy mam ace tea gge ttm eaa gac etg eee aae tgg g 153 
lie Pro Arg Xaa Thr Ser Gly Xaa Gin Asp Leu Pro Asn Trp 
15 10 

<210> 406 
<211> 206 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 57. .206 

<221> sig_jpeptide 

<222> 57. .173 

<223> Von Heijne matrix 

score 4.90000009536743 

seq VIAIVSFTTLCSS/LY 

<400> 406 

aaataaaaaa tattaaaaaa taatetcatc tttgatttta gatttagggg gtgtge atg 59 

Met 

eag get tgt tat atg ggt atg tgg tat act gcc gag get tgg ggt acg 107 
Gin Ala Cys Tyr Met Gly Met Trp Tyr Thr Ala Glu Ala Trp Gly Thr 

-35 -30 -25 

att gag tec etc acc cag gta gtg age gta ate gca ata gtt agt ttt 155 
lie Glu Ser Leu Thr Gin Val Val Ser Val He Ala He Val Ser Phe 

-20 -15 -10 

aca acc etg tgc tec tct etg tat tec eee eaa gta gtc eee agt gtt 
Thr Thr Leu Cys Ser Ser Leu Tyr Ser Pro Gin Val Val Pro Ser Val 
-5 15 10 

ggg 

Gly 

<210> 407 

<211> 479 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 277. .477 

<22l> sig_peptide 
<222> 277. .462 
<223> Von Heijne matrix 

score 4.90000009536743 



203 



206 



wo 99/53051 



221 



PCT/1B99/00712 



seq PLAACPLLLPIFS/HA 

<221> roisc_feature 
<222> 22 

<223> n=a, g, c or t 



<400> 407 

aatggattga gatggggaag anaaaaagcc ccaaattcat gaaatgtagc tgckacagtc 60 
cccacctcct tagctgtccc caaaacctaa gcaggtaatc ataacttcca ttctgtgctc 120 
accttacctc tgctggcacc tttttggaca gggttctcta cttggcgagg tgacccaaat 180 
cttcattcct gcagggtctg agtcctmrgc cgctgcgatc gtttgaacat tgtttgtccc 240 
cacmraaact catcttgagg cttggtcccc actgta atg atg ttg aga ggt ggc 294 

Met Met Leu Arg Gly Gly 
-60 

ggg aca ttt aag grg tgt ttg agt cat gag gga tec age tte aeg aag 342 
Gly Thr Phe Lys Xaa Cys Leu Ser His Glu Gly Ser Ser Phe Thr Lys 

-55 -50 -45 

gga tta gcg cag gag tgc gtg agt ret tct tgt ggg act cga ttg att 390 
Gly Leu Ala Gin Glu Cys Val Ser Xaa Ser Cys Gly Thr Arg Leu He 
-40 -35 -30 -25 

act gca gtw gee agt kgt tac aaa gca agg ctg cet ctg gee gcg tgc 438 
Thr Ala Val Ala Ser Xaa Tyr Lys Ala Arg Leu Pro Leu Ala Ala Cys 

-20 -15 -10 

ccd ctt ctg ctt cot att ttc tec cat get aga age age ac 479 
Pro Leu Leu Leu Pro lie Phe Ser His Ala Arg Ser Ser 
-5 15 



<210> 408 
<211> 289 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 84. .287 



<221> sigjpeptide 

<222> 84.. 203 

<223> Von Heijne matrix 

score 4.90000009536743 
seq SLKICGLVFGILA/LT 



<400> 408 

agccgactca cttgcaactc cacctcagea gtggtctctc agteetctea aageaaggaa 60 
agagtactgt gtgctgagag ace atg gca aag aat cet cca gag aat tgt gaa 113 

Met Ala Lys Asn Pro Pro Glu Asn Cys Glu 

-40 -35 



gac 


tgt 


cac 


att 


eta 


aat 


gca 


gaa 


get 


ttt 


aaa 


tec 


aag 


aaa 


ata 


tgt 


161 


Asp Cys 


His 


He 


Leu 


Asn 


Ala 


Glu 


Ala 


Phe 


Lys 


Ser 


Lys 


Lys 


He 


Cys 




-30 










-25 










-20 










-IS 




aaa 


tea 


ctt 


aag 


att 


tgt 


gga 


ctg 


gtg 


ttt 


ggt 


ate 


ctg 


gcc 


eta 


act 


209 


Lys 


Ser 


Leu 


Lys 


He 
-10 


Cys 


Gly 


Leu 


Val 


Phe 
-5 


Gly 


He 


Leu 


Ala 


Leu 
1 


Thr 




eta 


att 


gtc 


ctg 


ttt 


tgg 


ggg 


age 


aag 


cac 


ttc 


tgg 


ccg 


gag 


gta 


ecc 


257 


Leu 


He 


Val 
5 


Leu 


Phe 


Trp 


Gly 


Ser 
10 


Lys 


His 


Phe 


Trp 


Pro 
15 


Glu 


Val 


Pro 




aaa 


aaa 


gcc 


tat 


gac 


atg 


gag 


cac 


act 


a eg 


gg 












289 


Lys 


Lys 
20 


Ala 


Tyr 


Asp 


Met 


Glu 
25 


His 


Thr 


Thr 

















<210> 409 
<211> 341 



